Integrated adaptation with multi-factor joint-learning for far-field speech recognition

@article{Qian2016IntegratedAW,
  title={Integrated adaptation with multi-factor joint-learning for far-field speech recognition},
  author={Yanmin Qian and Tian Tan and Dong Yu and Yu Zhang},
  journal={2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  year={2016},
  pages={5770-5774}
}
Although great progress has been made in automatic speech recognition (ASR), significant performance degradation still exists in distant talking scenarios due to significantly lower signal power. In this paper, a novel adaptation framework, named integrated adaptation with multi-factor joint-learning, is proposed to improve the recognition accuracy for distant speech recognition. We explore and extract speaker, phone and environment factor representations using deep neural networks (DNNs… CONTINUE READING

Citations

Publications citing this paper.
Showing 1-10 of 10 extracted citations

Online environmental adaptation of CNN-based acoustic models using spatial diffuseness features

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2017
View 2 Excerpts
Highly Influenced

Deep Feature Engineering for Noise Robust Spoofing Detection

IEEE/ACM Transactions on Audio, Speech, and Language Processing • 2017

Neural Network Based Multi-Factor Aware Joint Training for Robust Speech Recognition

IEEE/ACM Transactions on Audio, Speech, and Language Processing • 2016
View 1 Excerpt

Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition

IEEE/ACM Transactions on Audio, Speech, and Language Processing • 2016

References

Publications referenced by this paper.
Showing 1-10 of 30 references

Front-End Factor Analysis For Speaker Verification

2018 International Conference on Communications (COMM) • 2018
View 1 Excerpt

An investigation into speaker informed DNN front-end for LVCSR

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2015
View 2 Excerpts

An investigation of augmenting speaker representations to improve speaker normalisation for DNN-based speech recognition

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2015
View 2 Excerpts

Deep feature for text-dependent speaker verification

Speech Communication • 2015
View 1 Excerpt

Improving speech recognition in reverberation using a room-aware deep neural network and multi-task learning

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2015
View 3 Excerpts

Similar Papers

Loading similar papers…