Visual Speech Synthesis by Morphing Visemes

  title={Visual Speech Synthesis by Morphing Visemes},
  author={Tony Ezzat and Tomaso A. Poggio},
  journal={International Journal of Computer Vision},
We present MikeTalk, a text-to-audiovisual speech synthesizer which converts input text into an audiovisual speech stream. MikeTalk is built using visemes, which are a small set of images spanning a large range of mouth shapes. The visemes are acquired from a recorded visual corpus of a human subject which is specifically designed to elicit one instantiation of each viseme. Using optical flow methods, correspondence from every viseme to every other viseme is computed automatically. By morphing… CONTINUE READING
Highly Influential
This paper has highly influenced 11 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 161 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 100 extracted citations

161 Citations

Citations per Year
Semantic Scholar estimates that this publication has 161 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 34 references

The Festival Speech Synthesis System

  • A. Black, P. Taylor
  • University of Edinburgh
  • 1997
Highly Influential
3 Excerpts

An advanced morphing algorithm for interpolating phoneme images to simulate speech

  • S. H. Watson, J. R. Wright, K. C. Scott, D. S. Kagels, D. Freda, K. J. Hussey
  • Jet Propulsion Laboratory, California Institute…
  • 1997
1 Excerpt

Similar Papers

Loading similar papers…