Learn More
Blind adaptive filtering for time delay of arrival (TDOA) estimation is a very powerful method for acoustic source localization in reverberant environments with broadband signals like speech. Based on a recently presented generic framework for blind signal processing for convolutive mixtures, called TRINICON, we present a TDOA estimation method for(More)
The available technologies for the presentation of audiovisual scenes to large audiences show different degrees of maturity. While high quality physics based rendering of 3D scenes is found in many visual applications, the presentation of the accompanying audio content is based on much simpler technologies. Multi-channel cinema sound systems are only(More)
This paper discusses novel methods for detecting and localizing multiple wideband acoustic sources using spherical apertures. In contrast to traditional methods the techniques presented here are not based on processing the output of individual microphones directly. Instead, the microphone signals are used to decompose the wave- field into its spherical(More)
In this paper a real-time system for immersive audio applications is presented. Sound sources are recorded using a microphone array whose beam is steered according to the output of an acoustic source localization and tracking system. The output of the beam-former (BF) along with the source position updates are continuously transmitted to a wave field(More)
Spherical microphone array eigenbeam (EB)-ESPRIT gives an elegant closed-form solution for 3D broadband source localization based on the spherical harmonics (eigenbeam) framework. However, in practical implementations, there are still several issues not being rigorously studied, e.g. how to avoid the ill-conditioning of an EB-ESPRIT matrix, solve the(More)
This paper is concerned with the problem of localizing multiple wideband acoustic sources. In contrast to existing techniques, this method takes the physics of wave propagation into account. 2D wave fields are decomposed using cylindrical harmonics as basis functions by a circular array mounted into a rigid cylindrical baffle. The obtained wave field(More)
This paper presents an audiovisual database that can be used as a reference database for testing and evaluation of video, audio or joint audiovisual person tracking algorithms, as well as speaker localization methods. Additional possible uses include the testing of face detection and pose estimation algorithms. A number of different scenes are included in(More)