Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression

@article{Deleforge2015CoLocalizationOA,
  title={Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression},
  author={Antoine Deleforge and Radu Horaud and Yoav Y. Schechner and Laurent Girin},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  year={2015},
  volume={23},
  pages={718-731}
}
This paper addresses the problem of localizing audio sources using binaural measurements. We propose a supervised formulation that simultaneously localizes multiple sources at different locations. The approach is intrinsically efficient because, contrary to prior work, it relies neither on source separation, nor on monaural segregation. The method starts with a training stage that establishes a locally linear Gaussian regression model between the directional coordinates of all the sources and… CONTINUE READING

Similar Papers

Citations

Publications citing this paper.
SHOWING 1-10 OF 25 CITATIONS

Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion

  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 2018
VIEW 4 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Localization of sound sources in robotics: A review

  • Robotics and Autonomous Systems
  • 2017
VIEW 5 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Audio-Visual Speech-Turn Detection and Tracking Israel

VIEW 4 EXCERPTS
CITES METHODS
HIGHLY INFLUENCED

Hearing in a shoe-box: Binaural source position and wall absorption estimation using virtually supervised learning

  • 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2016
VIEW 9 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Exploiting the Complementarity of Audio and Visual Data in Multi-speaker Tracking

  • 2017 IEEE International Conference on Computer Vision Workshops (ICCVW)
  • 2017
VIEW 3 EXCERPTS
CITES METHODS
HIGHLY INFLUENCED

VAST : The Virtual Acoustic Space Traveler Dataset

VIEW 5 EXCERPTS
CITES RESULTS, METHODS & BACKGROUND
HIGHLY INFLUENCED

Accurate Target Annotation in 3D from Multimodal Streams

  • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2019
VIEW 1 EXCERPT
CITES BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-10 OF 10 REFERENCES

Variational EM for binaural sound-source separation and localization

  • 2013 IEEE International Conference on Acoustics, Speech and Signal Processing
  • 2013
VIEW 20 EXCERPTS
HIGHLY INFLUENTIAL

Model-Based Expectation-Maximization Source Separation and Localization

  • IEEE Transactions on Audio, Speech, and Language Processing
  • 2010
VIEW 16 EXCERPTS
HIGHLY INFLUENTIAL

Blind separation of speech mixtures via time-frequency masking

  • IEEE Transactions on Signal Processing
  • 2004
VIEW 11 EXCERPTS
HIGHLY INFLUENTIAL

Self-localizing dynamic microphone arrays

VIEW 11 EXCERPTS
HIGHLY INFLUENTIAL

Robust real-time face detection

  • Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001
  • 2001
VIEW 8 EXCERPTS
HIGHLY INFLUENTIAL

2D sound-source localization on the binaural manifold

  • 2012 IEEE International Workshop on Machine Learning for Signal Processing
  • 2012
VIEW 5 EXCERPTS
HIGHLY INFLUENTIAL

Binaural Localization of Multiple Sources in Reverberant and Noisy Environments

  • IEEE Transactions on Audio, Speech, and Language Processing
  • 2012
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

Speech segregation based on sound localization

  • IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222)
  • 2001
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL