A Modified SRP-PHAT Functional for Robust Real-Time Sound Source Localization With Scalable Spatial Sampling

@article{Cobos2011AMS,
  title={A Modified SRP-PHAT Functional for Robust Real-Time Sound Source Localization With Scalable Spatial Sampling},
  author={M. Cobos and A. Marti and Jos{\'e} J. L{\'o}pez},
  journal={IEEE Signal Processing Letters},
  year={2011},
  volume={18},
  pages={71-74}
}
The Steered Response Power - Phase Transform (SRP-PHAT) algorithm has been shown to be one of the most robust sound source localization approaches operating in noisy and reverberant environments. However, its practical implementation is usually based on a costly fine grid-search procedure, making the computational cost of the method a real issue. In this letter, we introduce an effective strategy that extends the conventional SRP-PHAT functional with the aim of considering the volume… Expand
Real-time sound source localization using hybrid framework
Abstract Real-time sound source localization with microphone array still remains as a difficult task. The steered response power–phase transform (SRP–PHAT) method is proved to be robust, but itExpand
A note on the modified and mean-based steered-response power functionals for source localization in noisy and reverberant environments
  • M. Cobos
  • Engineering, Computer Science
  • 2014 6th International Symposium on Communications, Control and Signal Processing (ISCCSP)
  • 2014
TLDR
The modified SRP (M-SRP) was proposed to tackle the localization of one or multiple speakers in reverberant and/or noisy scenarios, improving the performance of the original method when coarser spatial resolutions are considered. Expand
Exploiting a Geometrically Sampled Grid in the SRP-PHAT for Localization Improvement and Power Response Sensitivity Analysis
TLDR
A spatial grid design procedure, called the geometrically sampled grid (GSG), which aims at computing the spatial grid by taking into account the discrete sampling of time difference of arrival (TDOA) functions and the desired spatial resolution. Expand
A Steered-Response Power Algorithm Employing Hierarchical Search for Acoustic Source Localization Using Microphone Arrays
TLDR
This paper proposes a new method for sound source localization (called H-SRP), which applies the SRP approach to space regions instead of grid points, and attains high performance with manageable complexity. Expand
Steered sample algorithm for acoustic source localization
TLDR
A steered sample algorithm (SSA) based on the reciprocity of wave propagation for acoustic source localization is proposed, similar to the hyperbolic Radon transform, which is theoretically analyzed and is the most essential difference form the SRP/SRP-PHAT. Expand
Proposal and validation of an analytical generative model of SRP-PHAT power maps in reverberant scenarios
TLDR
A novel parametric analytical model to predict SRP-PHAT power maps is formulated and shows high accuracy for very different acoustical and geometrical conditions, becoming an excellent tool to be exploited for the improvement of real world relevant applications related to acoustic localization. Expand
Real-Time Sound Source Localization in Videoconferencing Environments
Sound Source Localization (SSL) mechanisms have been extensively studied. Many applications like teleconferencing or speech enhancement systems require the localization of one or more acousticExpand
A two-stage approach for passive sound source localization based on the SRP-PHAT algorithm
TLDR
The aim of this paper is to develop a computationally viable approach to find the coordinate location of a sound source with acceptable accuracy and the proposed approach was successful in obtaining good results compared to the conventional SRP-PHAT algorithm. Expand
An Iterative Approach to Source Counting and Localization Using Two Distant Microphones
TLDR
A time difference of arrival (TDOA) estimation framework based on time-frequency inter-channel phase difference (IPD) and an iterative contribution removal (ICR) algorithm that successfully solves the GCC peak ambiguities between multiple sources and multiple reverberant paths. Expand
Sensitivity-based region selection in the steered response power algorithm
TLDR
A sensitivity-based region selection SRP (R-SRP) algorithm that exploits the nonuniform TDOA information accumulation on the search grid and improves the localization performance in comparison to the state-of-the-art. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 21 REFERENCES
A Fast Microphone Array SRP-PHAT Source Location Implementation using Coarse-To-Fine Region Contraction(CFRC)
  • Hoang Do, H. Silverman
  • Engineering
  • 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
  • 2007
Most real microphone-array applications require sound sources to be localized in a noisy, reverberant environment. In such conditions, the steered response power using the phase transform (SRP-PHAT)Expand
A Real-Time SRP-PHAT Source Location Implementation using Stochastic Region Contraction(SRC) on a Large-Aperture Microphone Array
  • Hoang Do, H. Silverman, Ying Yu
  • Computer Science
  • 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07
  • 2007
TLDR
This paper discusses one important SRP method, computing it from the phase transform (SRP-PHAT), reviews SRC, and shows the computational saving, showing that SRC saves computation by more than two orders of magnitude with almost no loss in accuracy. Expand
Performance of real-time source-location estimators for a large-aperture microphone array
TLDR
It is found that many small features are required to make a useful location estimating algorithm work and work well in real-time, and the current LEMSalg is being used successfully in a representative environment where microphone SNRs are below 0 dB. Expand
Estimation of direction of arrival using information theory
TLDR
Gaussianity of the source signal is assumed and an information-theoretical measure is used, often met in blind source separation algorithms, to derive a robust DOA estimator, even under significant reverberant conditions, to indicate that the examined framework can accurately track the DOA of a typical acoustic source. Expand
The Fusion of Distributed Microphone Arrays for Sound Localization
  • P. Aarabi
  • Computer Science
  • EURASIP J. Adv. Signal Process.
  • 2003
TLDR
The recently proposed sound localization technique, known as SRP-PHAT, is shown to be a special case of the more general microphone array integration mechanism presented here, which utilizes spatial likelihood functions produced by each microphone array and integrates them using a weighted addition of the individual SLFs. Expand
Optimal microphone placement for source localization using time delay estimation
TLDR
It is shown that discrete TDOAs result in a non-uniformly distributed spatial resolution across the room furthermore it is shown how the sample frequency and the microphone distance effect the spatial resolution. Expand
Time Delay Estimation in Room Acoustic Environments: An Overview
TLDR
A systematic overview of the state-of-the-art of time-delay-estimation algorithms ranging from the simple cross-correlation method to the advanced blind channel identification based techniques is presented. Expand
Robust Localization in Reverberant Rooms
TLDR
This chapter summarizes the current field and comments on the general merits and shortcomings of each genre, and presents a new localization method that is significantly more robust to acoustical conditions, particularly reverberation effects, than the traditional localization techniques in use today. Expand
The generalized correlation method for estimation of time delay
A maximum likelihood (ML) estimator is developed for determining time delay between signals received at two spatially separated sensors in the presence of uncorrelated noise. This ML estimator can beExpand
Microphone Arrays - Signal Processing Techniques and Applications
TLDR
This paper presents a meta-modelling architecture for microphone Array Processing that automates the very labor-intensive and therefore time-heavy and expensive process of manually shaping Microphone Arrays for Speech Input in Automobiles. Expand
...
1
2
3
...