Output-Gate Projected Gated Recurrent Unit for Speech Recognition

@inproceedings{Cheng2018OutputGatePG,
  title={Output-Gate Projected Gated Recurrent Unit for Speech Recognition},
  author={Gaofeng Cheng and Daniel Povey and L. Huang and J. Xu and S. Khudanpur and Y. Yan},
  booktitle={INTERSPEECH},
  year={2018}
}
In this paper, we describe the work on accelerating decoding speed while improving the decoding accuracy. Firstly, we propose an architecture which we call Projected Gated Recurrent Unit (PGRU) for automatic speech recognition (ASR) tasks, and show that the PGRU could outperform the standard GRU consistently. Secondly, in order to improve the PGRU’s generalization, especially for large-scale ASR task, the Output-gate PGRU (OPGRU) is proposed. Finally, time delay neural network (TDNN) and… Expand
12 Citations
An Exploration of Recurrent Units for Automatic Speech Recognition with RNN based Acoustic Model
  • H. Zhang
  • Computer Science
  • 2019 2nd International Conference on Information Systems and Computer Aided Education (ICISCAE)
  • 2019
Projected Minimal Gated Recurrent Unit for Speech Recognition
  • 1
  • Highly Influenced
  • PDF
Simplified LSTMS for Speech Recognition
Acoustic model training using self-attention for low-resource speech recognition
  • Highly Influenced
  • PDF
The LeVoice Far-Field Speech Recognition System for VOiCES from a Distance Challenge 2019
  • 1
  • PDF
Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
  • 4
  • PDF
A Comparison of Lattice-free Discriminative Training Criteria for Purely Sequence-trained Neural Network Acoustic Models
  • Chao Weng, Dong Yu
  • Computer Science, Mathematics
  • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2019
  • 4
  • PDF
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation
  • PDF
Voiceai Systems to NIST Sre19 Evaluation: Robust Speaker Recognition on Conversational Telephone Speech
  • Highly Influenced
Non-autoregressive Deliberation-Attention based End-to-End ASR
...
1
2
...

References

SHOWING 1-10 OF 26 REFERENCES
Improving Speech Recognition by Revising Gated Recurrent Units
  • 38
  • PDF
Training Deep Bidirectional LSTM Acoustic Model for LVCSR by a Context-Sensitive-Chunk BPTT Approach
  • K. Chen, Qiang Huo
  • Computer Science
  • IEEE/ACM Transactions on Audio, Speech, and Language Processing
  • 2016
  • 66
  • PDF
Light Gated Recurrent Units for Speech Recognition
  • 106
  • PDF
Low Latency Acoustic Modeling Using Temporal Convolution and LSTMs
  • 114
  • PDF
Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI
  • 575
  • PDF
Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition
  • 543
  • Highly Influential
  • PDF
Achieving Human Parity in Conversational Speech Recognition
  • 433
  • PDF
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
  • 5,591
  • Highly Influential
  • PDF
English Conversational Telephone Speech Recognition by Humans and Machines
  • 275
  • PDF
Speaker adaptation of neural network acoustic models using i-vectors
  • 545
  • PDF
...
1
2
3
...