Visual speech recognition for isolated digits using discrete cosine transform and local binary pattern features

@article{Jain2017VisualSR,
  title={Visual speech recognition for isolated digits using discrete cosine transform and local binary pattern features},
  author={Abhilash Jain and G. N. Rathna},
  journal={2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP)},
  year={2017},
  pages={368-372}
}
Visual Speech Recognition (VSR) deals with the task of extracting speech information from visual cues from a person's face while speaking. Accurate lip segmentation and modeling are essential in any VSR algorithm for good feature extraction. However, lip modeling is a complicated task and is not very robust in natural conditions. This paper describes a novel technique for limited vocabulary visual-only speech recognition that does not use lip modeling. For visual feature extraction, Discrete… CONTINUE READING

Similar Papers

Citations

Publications citing this paper.

References

Publications referenced by this paper.
SHOWING 1-10 OF 14 REFERENCES

Multimodal Deep Learning

VIEW 13 EXCERPTS
HIGHLY INFLUENTIAL

An audio-visual corpus for speech perception and automatic speech recognition.

  • The Journal of the Acoustical Society of America
  • 2006
VIEW 6 EXCERPTS
HIGHLY INFLUENTIAL

Research on Visual Speech Feature Extraction

  • 2009 International Conference on Computer Engineering and Technology
  • 2009
VIEW 2 EXCERPTS

A PCA Based Visual DCT Feature Extraction Method for Lip-Reading

  • 2006 International Conference on Intelligent Information Hiding and Multimedia
  • 2006
VIEW 1 EXCERPT

Face Description with Local Binary Patterns: Application to Face Recognition

  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 2006
VIEW 1 EXCERPT