Improving large vocabulary continuous speech recognition by combining GMM-based and reservoir-based acoustic modeling

Abstract

In earlier work we have shown that good phoneme recognition is possible with a so-called reservoir, a special type of recurrent neural network. In this paper, different architectures based on Reservoir Computing (RC) for large vocabulary continuous speech recognition are investigated. Besides experiments with HMM hybrids, it is shown that a RC-HMM tandem can achieve the same recognition accuracy as a classical HMM, which is a promising result for such a fairly new paradigm. It is also demonstrated that a state-level combination of the scores of the tandem and the baseline HMM leads to a significant improvement over the baseline. A word error rate reduction of the order of 20% relative is possible.

DOI: 10.1109/SLT.2012.6424206

3 Figures and Tables

Cite this paper

@inproceedings{Triefenbach2012ImprovingLV, title={Improving large vocabulary continuous speech recognition by combining GMM-based and reservoir-based acoustic modeling}, author={Fabian Triefenbach and Kris Demuynck and Jean-Pierre Martens}, booktitle={SLT}, year={2012} }