Exploiting the potential of auditory preprocessing for robust speech recognition by locally recurrent neural networks

Abstract

In this paper we present a robust speaker independent speech recognition system consisting of a feature extraction based on a model of the auditory periphery, and a Locally Recurrent Neural Network for scoring of the derived feature vectors. A number of recognition experiments were carried out to investigate the robustness of this combination against di erent types of noise in the test data. The proposed method is compared with Cepstral, RASTA, and JAH-RASTA processing for feature extraction and Hidden Markov Models for scoring. The presented results show that the information in features from the auditory model can be best exploited by Locally Recurrent Neural Networks. The robustness achieved by this combination is comparable to that of JAH-RASTA in combination with HMM but without any requirement for an explicit adaptation to the noise in speech pauses.

DOI: 10.1109/ICASSP.1997.596165

Extracted Key Phrases

3 Figures and Tables

Cite this paper

@inproceedings{Kasper1997ExploitingTP, title={Exploiting the potential of auditory preprocessing for robust speech recognition by locally recurrent neural networks}, author={Klaus Kasper and Herbert Reininger and Dietrich Wolf}, booktitle={ICASSP}, year={1997} }