Towards Real-Time Mispronunciation Detection in Kids' Speech

  title={Towards Real-Time Mispronunciation Detection in Kids' Speech},
  author={P. Plantinga and E. Fosler-Lussier},
  journal={2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)},
  • P. Plantinga, E. Fosler-Lussier
  • Published 2019
  • Computer Science, Engineering
  • 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
Modern mispronunciation detection and diagnosis systems have seen significant gains in accuracy due to the introduction of deep learning. However, these systems have not been evaluated for the ability to be run in real-time, an important factor in applications that provide rapid feedback. In particular, the state-of-the-art uses bi-directional recurrent networks, where a uni-directional network may be more appropriate. Teacher-student learning is a natural approach to use to improve a uni… Expand


Mispronunciation Detection and Diagnosis in L2 English Speech Using Multidistribution Deep Neural Networks
  • 58
  • PDF
A comparison-based approach to mispronunciation detection
  • 39
  • Highly Influential
  • PDF
Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams
  • 35
  • PDF
An Investigation of a Knowledge Distillation Method for CTC Acoustic Models
  • 25
  • PDF
CNN-RNN-CTC Based End-to-end Mispronunciation Detection and Diagnosis
  • Wai-Kim Leung, X. Liu, H. Meng
  • Computer Science
  • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2019
  • 11
  • PDF
EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding
  • 551
  • PDF
Acoustic modelling with CD-CTC-SMBR LSTM RNNS
  • 106
  • PDF
Improved Knowledge Distillation from Bi-Directional to Uni-Directional LSTM CTC for End-to-End Speech Recognition
  • 20
  • Highly Influential
Sequence-Level Knowledge Distillation
  • 298
  • PDF