Extraction of Audio Features Specific to Speech Production for Multimodal Speaker Detection

@article{Besson2008ExtractionOA,
  title={Extraction of Audio Features Specific to Speech Production for Multimodal Speaker Detection},
  author={Patricia Besson and Vlad Popovici and Jean-Marc Vesin and Jean-Philippe Thiran and Murat Kunt},
  journal={IEEE Transactions on Multimedia},
  year={2008},
  volume={10},
  pages={63-73}
}
A method that exploits an information theoretic framework to extract optimized audio features using video information is presented. A simple measure of mutual information (MI) between the resulting audio and video features allows the detection of the active speaker among different candidates. This method involves the optimization of an Mi-based objective function. No approximation is needed to solve this optimization problem, neither for the estimation of the probability density functions (pdfs… CONTINUE READING
Highly Cited
This paper has 63 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 29 extracted citations

Towards real-time audiovisual speaker localization

2011 19th European Signal Processing Conference • 2011
View 5 Excerpts
Highly Influenced

Audio-visual speaker localization via weighted clustering

2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP) • 2014
View 1 Excerpt

Cognitive workload and affective state: A computational study using Bayesian networks

2012 6th IEEE International Conference Intelligent Systems • 2012
View 1 Excerpt

63 Citations

01020'09'12'15'18
Citations per Year
Semantic Scholar estimates that this publication has 63 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-10 of 35 references

Speaker association with signal-level audiovisual fusion

IEEE Transactions on Multimedia • 2004
View 9 Excerpts
Highly Influenced

New ideas in optimization

View 5 Excerpts
Highly Influenced

Analysis of multimodal signals using redundant representations

IEEE International Conference on Image Processing 2005 • 2005
View 1 Excerpt

Similar Papers

Loading similar papers…