Corpus ID: 3264579

Multi-Channel Speech Recognition : LSTMs All the Way Through

  title={Multi-Channel Speech Recognition : LSTMs All the Way Through},
  author={Hakan Erdogan and Tomoki Hayashi and J. Hershey and T. Hori and Chiori Hori and Wei-Ning Hsu and Suyoun Kim and Jonathan Le Roux and Zhong Meng and Shinji Watanabe},
Long Short-Term Memory recurrent neural networks (LSTMs) have demonstrable advantages on a variety of sequential learning tasks. In this paper we demonstrate an LSTM “triple threat” system for speech recognition, where LSTMs drive the three main subsystems: microphone array processing, acoustic modeling, and language modeling. This LSTM trifecta is applied to the CHiME-4 distant recognition challenge. Our previous state-of-the-art ASR systems for the previous CHiME challenge employed LSTM mask… Expand
Speaker Adaptation for Multichannel End-to-End Speech Recognition
Densenet Blstm for Acoustic Modeling in Robust ASR
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Joint Training of Complex Ratio Mask Based Beamformer and Acoustic Model for Noise Robust Asr
  • Y. Xu, Chao Weng, +4 authors Dong Yu
  • Computer Science
  • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2019
L2RS: A Learning-to-Rescore Mechanism for Automatic Speech Recognition
Character-Aware Attention-Based End-to-End Speech Recognition
Non-Uniform MCE Training of Deep Long Short-Term Memory Recurrent Neural Networks for Keyword Spotting
Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming
Deep Learning for Environmentally Robust Speech Recognition


The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices
Recurrent deep neural networks for robust speech recognition
Deep beamforming networks for multi-channel speech recognition
  • X. Xiao, Shinji Watanabe, +7 authors Dong Yu
  • Computer Science, Engineering
  • 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2016
Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition
End-to-end attention-based large vocabulary speech recognition
KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
Joint CTC-attention based end-to-end speech recognition using multi-task learning
Towards End-To-End Speech Recognition with Recurrent Neural Networks