An Asynchronous Dbn for Audio-visual Speech Recognition

@article{Saenko2006AnAD,
  title={An Asynchronous Dbn for Audio-visual Speech Recognition},
  author={Kate Saenko and Karen Livescu},
  journal={2006 IEEE Spoken Language Technology Workshop},
  year={2006},
  pages={154-157}
}
We investigate an asynchronous two-stream dynamic Bayesian network-based model for audio-visual speech recognition. The model allows the audio and visual streams to de-synchronize within the boundaries of each word. The probability of de-synchronization by a given number of states is learned during training. This type of asynchrony has been previously used for pronunciation modeling and for visual speech recognition (lipreading); however, this is its first application to audiovisual speech… CONTINUE READING
Highly Cited
This paper has 19 citations. REVIEW CITATIONS
13 Extracted Citations
15 Extracted References
Similar Papers

Citing Papers

Publications influenced by this paper.
Showing 1-10 of 13 extracted citations

Referenced Papers

Publications referenced by this paper.
Showing 1-10 of 15 references

Similar Papers

Loading similar papers…