Triphone based unit selection for concatenative visual speech synthesis

@article{Huang2002TriphoneBU,
  title={Triphone based unit selection for concatenative visual speech synthesis},
  author={Fu Jie Huang and Eric Cosatto and Hans Peter Graf},
  journal={2002 IEEE International Conference on Acoustics, Speech, and Signal Processing},
  year={2002},
  volume={2},
  pages={II-2037-II-2040}
}
Concatenative visual speech synthesis selects frames from a large recorded video database of mouth shapes to generate photo-realistic talking head sequences. The synthesized sequence must exhibit precise lip-sound synchronization and smooth articulation. The selection process for finding the best lip shapes has been computationally expensive [1], limiting the speed of the synthesis to far less than real time. In this paper, we propose a rapid unit selection approach based on triphone units… CONTINUE READING
Highly Cited
This paper has 39 citations. REVIEW CITATIONS

References

Publications referenced by this paper.

Similar Papers

Loading similar papers…