Asynchronous integration of visual information in an automatic speech recognition system

@article{Alissali1996AsynchronousIO,
  title={Asynchronous integration of visual information in an automatic speech recognition system},
  author={Mamoun Alissali and P. Del{\'e}glise and A. Rogozan},
  journal={Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96},
  year={1996},
  volume={1},
  pages={34-37 vol.1}
}
  • Mamoun Alissali, P. Deléglise, A. Rogozan
  • Published 1996
  • Computer Science
  • Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
  • Deals with that integration of visual data in automatic speech recognition systems. We first describe the framework of our research; the development of advanced multi-user multi-modal interfaces. Then we present audio-visual speech recognition problems in general, and the ones we are interested in, in particular. After a very brief discussion of existing systems, we present the architecture of our audio-only reference and baseline systems and describe our audio-visual systems. The major part of… CONTINUE READING
    17 Citations
    Recent advances in the automatic recognition of audiovisual speech
    • 698
    • PDF
    Audio-Visual Automatic Speech Recognition: An Overview
    • 339
    Fusion of Audio-Visual Information for Integrated Speech Processing
    • 25
    CHAPTER 10 Audio-Visual Automatic Speech Recognition : An Overview
    • 5
    • PDF
    Continuous visual speech recognition using geometric lip-shape models and neural networks
    • 5
    • PDF
    Using likelihood L-statistics to measure confidence in audio-visual speech recognition
    • 4
    Adaptive Fusion of Speech and Lip Information for Robust Speaker Identification
    • 60
    Speech recognition in adverse environments using lip information
    • D. Thambiratnam, T. Wark, S. Sridharan, V. Chandran
    • Computer Science, Engineering
    • TENCON '97 Brisbane - Australia. Proceedings of IEEE TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162)
    • 1997
    • 2
    • PDF

    References

    SHOWING 1-10 OF 18 REFERENCES
    Improving connected letter recognition by lipreading
    • 164
    • PDF
    Effects of phonetic context on audio-visual intelligibility of French.
    • 137
    The forward-backward search algorithm
    • 103
    T
    • 146,878
    • PDF
    Scalar- and planar-valued curve fitting using splines under tension
    • A. Cline
    • Mathematics, Computer Science
    • CACM
    • 1974
    • 233
    Applications multimodales pour interfaces et bornes evolu ees
    • Ecole Th ematique \Fondements et Perspectives en Traitement Automatique de la Parole,
    • 1995
    Applications multimodales pour interfaces et bornes evolu ees
    • H. M eloni, editor, Ecole Th ematique \Fondements et Perspectives en Traitement Automatique de la Parole
    • 1995
    Integrating visual and acoustic information in speech recognition system based on
    • HMM. ICPhS,
    • 1995