Lip Reading Sentences in the Wild

  title={Lip Reading Sentences in the Wild},
  author={Joon Son Chung and A. Senior and Oriol Vinyals and Andrew Zisserman},
  journal={2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  • Joon Son Chung, A. Senior, +1 author Andrew Zisserman
  • Published 2017
  • Computer Science
  • 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
  • The goal of this work is to recognise phrases and sentences being spoken by a talking face, with or without the audio. Unlike previous works that have focussed on recognising a limited number of words or phrases, we tackle lip reading as an open-world problem – unconstrained natural language sentences, and in the wild videos. Our key contributions are: (1) a Watch, Listen, Attend and Spell (WLAS) network that learns to transcribe videos of mouth motion to characters, (2) a curriculum… CONTINUE READING
    342 Citations
    Deep Audio-Visual Speech Recognition
    • 149
    • PDF
    Learning to lip read words by watching videos
    • 35
    Lip Reading Sentences Using Deep Learning With Only Visual Cues
    • PDF
    Experimenting with lipreading for large vocabulary continuous speech recognition
    • K. Palecek
    • Computer Science
    • Journal on Multimodal User Interfaces
    • 2018
    • 1
    Large-Scale Visual Speech Recognition
    • 43
    • PDF
    A Lip Reading Model Using CNN with Batch Normalization
    • 4
    Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
    • 6
    • Highly Influenced
    • PDF
    Word Spotting in Silent Lip Videos
    • 12
    • Highly Influenced
    • PDF
    Deep Learning for Lip Reading using Audio-Visual Information for Urdu Language
    • 5
    • Highly Influenced
    • PDF


    LipNet: Sentence-level Lipreading
    • 104
    • PDF
    Listen, Attend and Spell
    • 288
    • PDF
    Comparing visual features for lipreading
    • 64
    • PDF
    A review of recent advances in visual speech decoding
    • 121
    Lipreading with long short-term memory
    • 131
    • PDF
    Attention-Based Models for Speech Recognition
    • 1,442
    • Highly Influential
    • PDF
    Lipreading using convolutional neural network
    • 79
    • PDF
    Deep multimodal learning for Audio-Visual Speech Recognition
    • 148
    • PDF
    Lip Reading in the Wild
    • 243
    • PDF