A Modified SRP-PHAT Functional for Robust Real-Time Sound Source Localization With Scalable Spatial Sampling

  title={A Modified SRP-PHAT Functional for Robust Real-Time Sound Source Localization With Scalable Spatial Sampling},
  author={Maximo Cobos and Amparo Marti and Jos{\'e} J. L{\'o}pez},
  journal={IEEE Signal Processing Letters},
The Steered Response Power - Phase Transform (SRP-PHAT) algorithm has been shown to be one of the most robust sound source localization approaches operating in noisy and reverberant environments. However, its practical implementation is usually based on a costly fine grid-search procedure, making the computational cost of the method a real issue. In this letter, we introduce an effective strategy that extends the conventional SRP-PHAT functional with the aim of considering the volume… 

Figures and Tables from this paper

Iterative Modified SRP-PHAT with Adaptive Search Space for Acoustic Source Localization
The proposed method achieves higher accuracy with only a marginal increase in computation expense than that of iterative modified SRP-PHAT and is compared to the state-of-the-art techniques on a simulated dataset and SMARD database to validate its reliability.
Real-time sound source localization using hybrid framework
A note on the modified and mean-based steered-response power functionals for source localization in noisy and reverberant environments
  • M. Cobos
  • Computer Science
    2014 6th International Symposium on Communications, Control and Signal Processing (ISCCSP)
  • 2014
The modified SRP (M-SRP) was proposed to tackle the localization of one or multiple speakers in reverberant and/or noisy scenarios, improving the performance of the original method when coarser spatial resolutions are considered.
A Steered-Response Power Algorithm Employing Hierarchical Search for Acoustic Source Localization Using Microphone Arrays
This paper proposes a new method for sound source localization (called H-SRP), which applies the SRP approach to space regions instead of grid points, and attains high performance with manageable complexity.
Steered sample algorithm for acoustic source localization
A steered sample algorithm (SSA) based on the reciprocity of wave propagation for acoustic source localization is proposed, similar to the hyperbolic Radon transform, which is theoretically analyzed and is the most essential difference form the SRP/SRP-PHAT.
Exploiting a geometrically sampled grid in the steered response power algorithm for localization improvement.
A spatial grid design procedure, called the geometrically sampled grid (GSG), which aims at computing the spatial grid by taking into account the discrete sampling of time difference of arrival (TDOA) functions and the desired spatial resolution.
Real-Time Sound Source Localization in Videoconferencing Environments
Sound Source Localization (SSL) mechanisms have been extensively studied. Many applications like teleconferencing or speech enhancement systems require the localization of one or more acoustic
A two-stage approach for passive sound source localization based on the SRP-PHAT algorithm
The aim of this paper is to develop a computationally viable approach to find the coordinate location of a sound source with acceptable accuracy and the proposed approach was successful in obtaining good results compared to the conventional SRP-PHAT algorithm.
An Iterative Approach to Source Counting and Localization Using Two Distant Microphones
A time difference of arrival (TDOA) estimation framework based on time-frequency inter-channel phase difference (IPD) and an iterative contribution removal (ICR) algorithm that successfully solves the GCC peak ambiguities between multiple sources and multiple reverberant paths.


A Fast Microphone Array SRP-PHAT Source Location Implementation using Coarse-To-Fine Region Contraction(CFRC)
  • Hoang Do, H. Silverman
  • Computer Science
    2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
  • 2007
This paper introduces the use of coarse-to-fine region contraction (CFRC) to make computing the SRP practical, and shows that CFRC costs about the same as SRC overall, but requires only about 63% of SRC's cost under very noisy conditions.
Performance of real-time source-location estimators for a large-aperture microphone array
It is found that many small features are required to make a useful location estimating algorithm work and work well in real-time, and the current LEMSalg is being used successfully in a representative environment where microphone SNRs are below 0 dB.
The Fusion of Distributed Microphone Arrays for Sound Localization
The recently proposed sound localization technique, known as SRP-PHAT, is shown to be a special case of the more general microphone array integration mechanism presented here, which utilizes spatial likelihood functions produced by each microphone array and integrates them using a weighted addition of the individual SLFs.
Optimal microphone placement for source localization using time delay estimation
It is shown that discrete TDOAs result in a non-uniformly distributed spatial resolution across the room furthermore it is shown how the sample frequency and the microphone distance effect the spatial resolution.
Time Delay Estimation in Room Acoustic Environments: An Overview
A systematic overview of the state-of-the-art of time-delay-estimation algorithms ranging from the simple cross-correlation method to the advanced blind channel identification based techniques is presented.
Robust Localization in Reverberant Rooms
This chapter summarizes the current field and comments on the general merits and shortcomings of each genre, and presents a new localization method that is significantly more robust to acoustical conditions, particularly reverberation effects, than the traditional localization techniques in use today.
The generalized correlation method for estimation of time delay
A maximum likelihood (ML) estimator is developed for determining time delay between signals received at two spatially separated sensors in the presence of uncorrelated noise. This ML estimator can be
Microphone Arrays - Signal Processing Techniques and Applications
This paper presents a meta-modelling architecture for microphone Array Processing that automates the very labor-intensive and therefore time-heavy and expensive process of manually shaping Microphone Arrays for Speech Input in Automobiles.
A Novel Efficient Cluster-Based MLSE Equalizer for Satellite Communication Channels with-QAM Signaling
A cluster-based sequence equalizer (CBSE) that takes advantage of the symmetries of the-ary quadrature amplitude modulation (-QAM) constellation and exhibits enhanced performance compared to other techniques, including the conventional linear transversal equalizer, Volterra equalizers, and RBF network equalizers.
A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays
He went on to Brown University in Providence, Rhode Island to study signal processing and began research on microphone arrays and received a Master of Science degree in Electrical Engineering in 1993 and continued to pursue his work towards a Doctor of Philosophy degree.