A real-time prototype for small-vocabulary audio-visual ASR

We present a prototype for the automatic recognition of audiovisual speech, developed to augment the IBM ViaVoice TM speech recognition system. Frontal face, full frame video is captured through a USB 2.0 interface by means of an inexpensive PC camera, and processed to obtain appearance-based visual features. Subsequently, these are combined with audio… CONTINUE READING