Supervised speech dereverberation in noisy environments using exemplar-based sparse representations

Abstract

Exemplar-based techniques, where the noisy speech is decomposed as a linear combination of the speech and noise exemplars stored in a dictionary, have been successfully used for speech enhancement in noisy environments. This paper extends this technique to achieve speech dereverberation in noisy environments by means of a nonnegative approximation of the noisy reverberant speech in the frequency domain. A novel approach for estimating the room impulse response (RIR) together with the speech and noise estimates using a non-negative matrix deconvolution (NMD)-based technique is proposed. In addition, we extend an existing technique based on nonnegative matrix factorisation (NMF) that performs speech derever-beration in noise-free environments to noisy scenarios. New estimators for jointly obtaining the RIR and exemplar weights for the NMD and NMF-based formulations are presented. The proposed techniques are evaluated on the noise-free and noisy reverberant speech in the CHiME-2 WSJ0 database and are shown to yield better speech enhancement in terms of signal-to-distortion ratio (SDR), perceptual evaluation of speech quality (PESQ) and cepstral distance (CD) measures.

DOI: 10.1109/ICASSP.2016.7471656

2 Figures and Tables

Showing 1-10 of 18 references

Theory and Practice (Signal Processing and Communications)

  • P C Loizou, Speech Enhancement
  • 2007
Highly Influential
8 Excerpts
Showing 1-2 of 2 extracted citations