Statistical modeling of binaural signal and its application to binaural source separation

Abstract

This paper addresses a new statistical model of binaural signals and its application to efficient binaural source separation. Binaural source separation is always required to retain a spatial cue of the separated sound, such as a head-related transfer function (HRTF). However, the direct use of an HRTF is not realistic because this information is normally not known in advance. To cope with this problem, first, we focus on the difference between signal probability density functions at both ears, which can be blindly estimated by using our previous work on higher-order statistics. Next, we derive a sound-localization-preserved generalized minimum mean-square error short-time spectral amplitude estimator. Objective and subjective experiments show the efficacy of the proposed method in terms of spatial quality.

DOI: 10.1109/ICASSP.2015.7178018

4 Figures and Tables

Cite this paper

@article{Murota2015StatisticalMO, title={Statistical modeling of binaural signal and its application to binaural source separation}, author={Yuki Murota and Daichi Kitamura and Shoichi Koyama and Hiroshi Saruwatari and Satoshi Nakamura}, journal={2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, year={2015}, pages={494-498} }