Audiovisual voice activity detection using off-the-shelf cameras


This paper presents a new audiovisual voice activity detection (VAD) method for off-the-shelf cameras presenting a color sensor and two microphones. The motion of particles in the mouth region of each face detected by the camera is used as video cue, while the Generalized Cross Correlation with the PHase Transform (GCC-PHAT) is used as audio cue. We then… (More)
DOI: 10.1109/ICIP.2015.7351533


4 Figures and Tables

Cite this paper

@article{Montazzolli2015AudiovisualVA, title={Audiovisual voice activity detection using off-the-shelf cameras}, author={S. Montazzolli and C. R. Jung and Dan Gelb}, journal={2015 IEEE International Conference on Image Processing (ICIP)}, year={2015}, pages={3886-3890} }