The fusion of visual lip movements and mixed speech signals for robust speech separation

@article{Aarabi2004TheFO,
  title={The fusion of visual lip movements and mixed speech signals for robust speech separation},
  author={Parham Aarabi and Bobji Mungamuru},
  journal={Information Fusion},
  year={2004},
  volume={5},
  pages={103-117}
}
A technique for the early fusion of visual lip movements and a vector of mixed speech signals is proposed. This technique involves the initial recreation of speech signals entirely from the visual lip motions of each speaker. By using geometric parameters of the lips obtained from the Tulips1 database and the Audio-Visual Speech Processing dataset, a virtual speech signal is recreated by using audiovisual training segments as a basis for the recreation. It is shown that the visually created… CONTINUE READING

References

Publications referenced by this paper.
Showing 1-10 of 32 references

Robust Speech Separation Using Visually Constructed Speech Signals. To appear in Proceedings of Sensor Fusion: Architectures, Algorithms, and Applications VI (AeroSense’01)

  • P. Aarabi, N. H. Khameneh
  • 2002
2 Excerpts

Multi-modal Sound Localization Using Audiovisual Information Fusion, Information Fusion, Volume 3, Issue

  • P. Aarabi, S Zaky
  • 2001
3 Excerpts

Integrated Vision and Sound Localization

  • P. Aarabi, S. Zaky
  • In Proceedings of the 3 International Conference…
  • 2000
3 Excerpts

Similar Papers

Loading similar papers…