A general artificial neural network extension for HTK

@inproceedings{Zhang2015AGA,
  title={A general artificial neural network extension for HTK},
  author={Chao Zhang and Philip C. Woodland},
  booktitle={INTERSPEECH},
  year={2015}
}
This paper describes the recently developed artificial neural network (ANN) modules in HTK hidden Markov model toolkit, which enables ANN models with very general feed-forward architectures to be used for either acoustic modelling or feature extraction. The HTK ANN extension includes many recent ANN-based speech processing techniques, such as sequence training, model stacking, speaker adaptation, and parameterised activation functions. The implementation allows efficient training by supporting… CONTINUE READING
Highly Cited
This paper has 22 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 18 extracted citations

Estimating Speech Recognition Accuracy Based on Error Type Classification

IEEE/ACM Transactions on Audio, Speech, and Language Processing • 2016
View 4 Excerpts
Highly Influenced

High Order Recurrent Neural Networks for Acoustic Modelling

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2018

Improved Tdnns Using Deep Kernels and Frequency Dependent Grid-RNNS

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2018

Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2017

Sequence training of DNN acoustic models with natural gradient

2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) • 2017
View 1 Excerpt

References

Publications referenced by this paper.
Showing 1-10 of 39 references

Quicknet

D. Johnson
http://www1.icsi.berkeley. edu/speech/qn.html.
View 5 Excerpts
Highly Influenced

Asynchronous stochastic optimization for sequence training of deep neural networks

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) • 2014
View 9 Excerpts
Highly Influenced

Pattern Recognition and Machine Learning

J. Electronic Imaging • 2007
View 9 Excerpts
Highly Influenced

BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation

F. Flego, L.-L. Wang, C. Zhang, M. J. F. Gales, P. C. Woodland
Proc . Interspeech ’ • 2014

Similar Papers

Loading similar papers…