Corpus ID: 237532792

A Survey of Sound Source Localization with Deep Learning Methods

  title={A Survey of Sound Source Localization with Deep Learning Methods},
  author={Pierre-Amaury Grumiaux and Srdjan Kiti'c and Laurent Girin and Alexandre Gu'erin},
This article is a survey on deep learning methods for single and multiple sound source localization. We are particularly interested in sound source localization in indoor/domestic environment, where reverberation and diffuse noise are present. We provide an exhaustive topography of the neural-based localization literature in this context, organized according to several aspects: the neural network architecture, the type of input features, the output strategy (classification or regression), the… Expand
1 Citations

Figures and Tables from this paper

Estimation of Azimuth and Elevation for Multiple Acoustic Sources Using Tetrahedral Microphone Arrays and Convolutional Neural Networks
A method for multiple acoustic source localization using a tetrahedral microphone array and a convolutional neural network (CNN) is presented. Our method presents a novel approach for the estimationExpand


SSLIDE: Sound Source Localization for Indoors Based on Deep Learning
This paper presents SSLIDE, Sound Source Localization for Indoors using DEep learning, which applies deep neural networks with encoder-decoder structure to localize sound sources with random positions in a continuous space to outperform multiple signal classification, steered response power with phase transform, sparse Bayesian learning, and a competing convolutional neural network approach in the reverberant environment. Expand
Sound Source Localization Using Deep Learning Models
This study shows that with end-to-end training and generic preprocessing, the performance of deep residual networks not only surpasses the block level accuracy of linear models on nearly clean environments but also shows robustness to challenging conditions by exploiting the time delay on power information. Expand
Learning Multiple Sound Source 2D Localization
The results show that the method improves upon the previous baseline approach for this problem, and new metrics are developed relying on resolution-based multiple source association. Expand
A deep learning method for grid-free localization and quantification of sound sources.
The investigation reveals a method that fast and accurately renders the position and strength of an unknown source and the accuracy of the position estimation is higher than the grid resolution of the beamforming map. Expand
Discriminative multiple sound source localization based on deep neural networks using independent location model
The experiments indicated that the SSL based on DNNs trained by the proposed training method out-performed a conventional SSL method by a maximum of 18 points in terms of block-level correctness. Expand
Unsupervised adaptation of deep neural networks for sound source localization using entropy minimization
  • Ryu Takeda, Kazunori Komatani
  • Computer Science
  • 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2017
This paper describes an unsupervised method of adapting deep neural networks (DNNs) for sound source localization (SSL) that improved localization accuracy by a maximum of 20 points for unknown positions and reverberant data. Expand
Towards End-to-End Acoustic Localization Using Deep Learning: From Audio Signals to Source Position Coordinates
This paper presents a novel approach for indoor acoustic source localization using microphone arrays, based on a Convolutional Neural Network designed to directly estimate the three-dimensional position of a single acoustic source using the raw audio signal as the input information and avoiding the use of hand-crafted audio features. Expand
Deep Ranking-Based Sound Source Localization
A novel weakly-supervised deep-learning localization method that exploits only a few labeled (anchor) samples with known positions, together with a larger set of unlabeled samples, for which the authors only know their relative physical ordering. Expand
Source localization in reverberant rooms using Deep Learning and microphone arrays
This paper presents an efficient tensor GPU-based computation of synthetic room impulse responses using fractional delays for image source models, and analyzes the localization performances of the proposed neural network fed with this dataset, which allows a significant improvement in terms of SSL accuracy over the traditional MUSIC and SRP-PHAT methods. Expand
Sound Localization Based on Phase Difference Enhancement Using Deep Neural Networks
A DNN-based phase difference enhancement for DoA estimation, which turned out to be better than the direct estimation of the DoAs from the input interchannel phase differences (IPDs). Expand