Look Who's Talking: Speaker Detection using Video and Audio Correlation

  title={Look Who's Talking: Speaker Detection using Video and Audio Correlation},
  author={Ross Cutler and Larry S. Davis},
  booktitle={IEEE International Conference on Multimedia and Expo},
The visual motion of the mouth and the corresponding audio data generated when a person speaks are highly correlated. This fact has been exploited for lip/speechreading a nd for improving speech recognition. We describe a method of automatically detecting a talking person (both spatiall y and temporally) using video and audio data from a single microphone. The audio-visual correlation is learned using a TDNN, which is then used to perform a spatio-temporal search for a speaking person… CONTINUE READING
Highly Cited
This paper has 132 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 90 extracted citations

Moving Humans Detection Based on Multi-Modal Sensor Fusion

2004 Conference on Computer Vision and Pattern Recognition Workshop • 2004
View 4 Excerpts
Highly Influenced

132 Citations

Citations per Year
Semantic Scholar estimates that this publication has 132 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 20 references

Neural Networks for Pattern Recognition

C. Bishop
Oxford University Press • 1995
View 12 Excerpts
Highly Influenced

"Eigenlips" for robust speech recognition

View 3 Excerpts
Highly Influenced

Morgan.Speech and audio signal processing

N. B. Gold
View 1 Excerpt

Voice puppetry

M. Brand
View 1 Excerpt

Neural Network-Based Face Detection

IEEE Trans. Pattern Anal. Mach. Intell. • 1998
View 1 Excerpt

X Vision: A Portable Substrate for Real-Time Vision Applications

Computer Vision and Image Understanding • 1998
View 2 Excerpts

Recurrence plots revisited

M. Casdagli
Physica D, 108:12– 44 • 1997
View 1 Excerpt

Similar Papers

Loading similar papers…