A system for audio-visual speech recognition


In this work, a system of audio visual speech recognition will be presented. A new hybrid visual feature combination, which is suitable for audio -visual speech recognition was implemented. The features comprise both the shape and the appearance of lips, the dimensional reduction is applied using discrete cosine transform (DCT). A large visual speech database of the German language has been assembled, the German Audio -Visual Database (GAVD). The conducted experiments using only visual features resulted in a high recognition accuracy and improved the audio-visual speech recognition drastically.

Extracted Key Phrases

3 Figures and Tables

Cite this paper

@inproceedings{Shdaifat2005ASF, title={A system for audio-visual speech recognition}, author={Islam Shdaifat and Rolf-Rainer Grigat}, booktitle={INTERSPEECH}, year={2005} }