Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks

@article{Prabhavalkar2015AutomaticGC,
  title={Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks},
  author={Rohit Prabhavalkar and Raziel Alvarez and Carolina Parada and Preetum Nakkiran and Tara N. Sainath},
  journal={2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  year={2015},
  pages={4704-4708}
}
We explore techniques to improve the robustness of small-footprint keyword spotting models based on deep neural networks (DNNs) in the presence of background noise and in far-field conditions. We find that system performance can be improved significantly, with relative improvements up to 75% in far-field conditions, by employing a combination of multi-style training and a proposed novel formulation of automatic gain control (AGC) that estimates the levels of both speech and background noise… CONTINUE READING
Highly Cited
This paper has 21 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

Citations

Publications citing this paper.
Showing 1-10 of 16 extracted citations

Generalized End-to-End Loss for Speaker Verification

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2018
View 5 Excerpts
Highly Influenced

Attention-Based Models for Text-Dependent Speaker Verification

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2018
View 2 Excerpts

Speaker Diarization with LSTM

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2018
View 1 Excerpt

Supervised Noise Reduction for Multichannel Keyword Spotting

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2018

Automatic gain control with integrated signal enhancement for specified target and background-noise levels

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2017
View 1 Excerpt

CNN-based bottleneck feature for noise robust query-by-example spoken term detection

2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) • 2017
View 1 Excerpt

References

Publications referenced by this paper.
Showing 1-10 of 19 references

Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition

IEEE/ACM Transactions on Audio, Speech, and Language Processing • 2014
View 1 Excerpt

Small-footprint keyword spotting using deep neural networks

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2014
View 19 Excerpts

Developing speech recognition systems for corpus indexing under the IARPA Babel program

2013 IEEE International Conference on Acoustics, Speech and Signal Processing • 2013

On rectified linear units for speech processing

2013 IEEE International Conference on Acoustics, Speech and Signal Processing • 2013
View 1 Excerpt

Similar Papers

Loading similar papers…