Audio-visual speech recognition

Known as: AVSR, Audio visual speech recognition 
Audio visual speech recognition (AVSR) is a technique that uses image processing capabilities in lip reading to aid speech recognition systems in… (More)
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2014
Highly Cited
2014
Audio-visual speech recognition (AVSR) system is thought to be one of the most promising solutions for reliable speech… (More)
  • figure 1
  • figure 2
  • table 1
  • table 2
  • table 3
Is this relevant?
Highly Cited
2010
Highly Cited
2010
In audio-visual automatic speech recognition (AVASR) both acoustic and visual modalities of speech are used to identify what a… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
2009
2009
The problem of feature selection has been thoroughly analyzed in the context of pattern classification, with the purpose of… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2002
Highly Cited
2002
The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speech generation mechanism, which… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2002
Highly Cited
2002
In recent years several speech recognition systems that use visual together with audio information showed significant increase in… (More)
  • table 1
  • figure 1
  • figure 3
  • figure 2
  • table 3
Is this relevant?
Highly Cited
2002
Highly Cited
2002
It has been shown that integration of acoustic and visual information especially in noisy conditions yields improved speech… (More)
  • figure 1
  • figure 2
  • figure 3
  • table 1
  • figure 4
Is this relevant?
Highly Cited
2001
Highly Cited
2001
This paper addresses the problem of audio-visual information fusion to provide highly robust speech recognition. We investigate… (More)
  • figure 1
  • figure 2
  • figure 3
  • table 1
Is this relevant?
1998
1998
We address the problem of robust lip tracking visual speech feature extraction and sensor integration for audio visual speech… (More)
Is this relevant?
Highly Cited
1998
Highly Cited
1998
We propose the use of discriminative training by means of the generalized probabilistic descent (GPD) algorithm to estimate… (More)
Is this relevant?
Highly Cited
1996
Highly Cited
1996
Developments in dynamic contour tracking permit sparse representation of the outlines of moving contours. Given the increasing… (More)
  • figure 3
Is this relevant?