A new linear MMSE filter for single channel speech enhancement based on Nonnegative Matrix Factorization

Abstract

In this paper, a linear MMSE filter is derived for single-channel speech enhancement which is based on Nonnegative Matrix Factorization (NMF). Assuming an additive model for the noisy observation, an estimator is obtained by minimizing the mean square error between the clean speech and the estimated speech components in the frequency domain. In addition, the noise power spectral density (PSD) is estimated using NMF and the obtained noise PSD is used in a Wiener filtering framework to enhance the noisy speech. The results of the both algorithms are compared to the result of the same Wiener filtering framework in which the noise PSD is estimated using a recently developed MMSE-based method. NMF based approaches outperform the Wiener filter with the MMSE-based noise PSD tracker for different measures. Compared to the NMF-based Wiener filtering approach, Source to Distortion Ratio (SDR) is improved for the evaluated noise types for different input SNRs using the proposed linear MMSE filter.

DOI: 10.1109/ASPAA.2011.6082303

2 Figures and Tables

Showing 1-10 of 16 references

Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assesment of narrowband telephone networks and speech codecs

  • I.-T P
  • 2000
Highly Influential
14 Excerpts

An evaluation of noise power spectral density estimation algorithms in adverse acoustic environments

  • J Taghia, N Mohammadiha, J Sang, V Bouse, R Martin
  • 2011
1 Excerpt

Digital Speech Transmission: Enhancement, Coding and Error Concealment

  • P Vary, R Martin
  • 2006
Showing 1-10 of 24 extracted citations