A robust audio-visual speech recognition using audio-visual voice activity detection

@inproceedings{Tamura2010ARA,
  title={A robust audio-visual speech recognition using audio-visual voice activity detection},
  author={Satoshi Tamura and Masato Ishikawa and Takashi Hashiba and Shin'ichi Takeuchi and Satoru Hayamizu},
  booktitle={INTERSPEECH},
  year={2010}
}
This paper proposes a novel speech recognition method combining Audio-Visual Voice Activity Detection (AVVAD) and Audio-Visual Automatic Speech Recognition (AVASR). AVASR has been developed to enhance the robustness of ASR in noisy environments, using visual information in addition to acoustic features. Similarly, AVVAD increases the precision of VAD in noisy conditions, which detects presence of speech from an audio signal. In our approach, AVVAD is conducted as a preprocessing followed by an… CONTINUE READING