Integration of Audio and Visual Information for Content-based Video Segmentation


A video sequence usually consists of separate scenes, and each scene includes many shots. For video understanding purposes, it is most important to detect scene breaks. To analyze the content of each scene, detection of shot breaks is also required. Usually, a scene break is associated with a simultaneous change of image, motion, and audio characteristics, while a shot break is only accompanied with changes in image or motion or both. We propose to use audio information along with image and motion information to accomplish segmentation at different levels. Promising results have been obtained with videos digitized from TV programs.

Extracted Key Phrases

10 Figures and Tables


Citations per Year

98 Citations

Semantic Scholar estimates that this publication has 98 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Huang1998IntegrationOA, title={Integration of Audio and Visual Information for Content-based Video Segmentation}, author={Jincheng Huang and Zhu Liu and Yao Wang}, booktitle={ICIP}, year={1998} }