Statistical modeling of binaural signal and its application to binaural source separation


This paper addresses a new statistical model of binaural signals and its application to efficient binaural source separation. Binaural source separation is always required to retain a spatial cue of the separated sound, such as a head-related transfer function (HRTF). However, the direct use of an HRTF is not realistic because this information is normally not known in advance. To cope with this problem, first, we focus on the difference between signal probability density functions at both ears, which can be blindly estimated by using our previous work on higher-order statistics. Next, we derive a sound-localization-preserved generalized minimum mean-square error short-time spectral amplitude estimator. Objective and subjective experiments show the efficacy of the proposed method in terms of spatial quality.

DOI: 10.1109/ICASSP.2015.7178018

4 Figures and Tables

Showing 1-10 of 22 references

MPEG spatial audio object coding-the ISO/MPEG standard for efficient coding of interactive audio scenes

  • J Herre, H Purnhagen, +11 authors H.-O Oh
  • 2012

Binaural hearing aid using sound-localization-preserved MMSE STSA estimator with ICA-based noise estimation

  • H Saruwatari, M Go, R Okamoto, K Shikano, H Hosoi
  • 2010
1 Excerpt

Interactive controller for audio object localization based on spatial representative vector operation

  • N Kamado, H Nawata, H Saruwatari, K Shikano
  • 2010
1 Excerpt