A Hybrid Approach for Speech Enhancement Using MoG Model and Neural Network Phoneme Classifier

@article{Chazan2016AHA,
  title={A Hybrid Approach for Speech Enhancement Using MoG Model and Neural Network Phoneme Classifier},
  author={Shlomo E. Chazan and Jacob Goldberger and Sharon Gannot},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  year={2016},
  volume={24},
  pages={2516-2530}
}
In this paper, we present a single-microphone speech enhancement algorithm. A hybrid approach is proposed merging the generative mixture of Gaussians MoG model and the discriminative deep neural network DNN. The proposed algorithm is executed in two phases, the training phase, which does not recur, and the test phase. First, the noise-free speech log-power spectral density is modeled as an MoG, representing the phoneme-based diversity in the speech signal. A DNN is then trained with phoneme… CONTINUE READING