Audio content analysis for online audiovisual data segmentation and classification

@article{Zhang2001AudioCA,
  title={Audio content analysis for online audiovisual data segmentation and classification},
  author={Tong Zhang and C.-C. Jay Kuo},
  journal={IEEE Trans. Speech and Audio Processing},
  year={2001},
  volume={9},
  pages={441-457}
}
While current approaches for audiovisual data segmentation and classification are mostly focused on visual cues, audio signals may actually play a more important role in content parsing for many applications. An approach to automatic segmentation and classification of audiovisual data based on audio content analysis is proposed. The audio signal from movies or TV programs is segmented and classified into basic types such as speech, music, song, environmental sound, speech with music background… CONTINUE READING
Highly Influential
This paper has highly influenced 27 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 443 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 270 extracted citations

444 Citations

0204060'01'04'08'12'16
Citations per Year
Semantic Scholar estimates that this publication has 444 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-10 of 33 references

Bregman,Auditory Scene Analysis: The Perceptual Organization of Sound

  • A S.
  • 1990
Highly Influential
4 Excerpts

Hamadaet al., “Video handling with music and speech detection,

  • K. Minami, A. Akutsu
  • IEEE Multimedia,
  • 1998
1 Excerpt

Similar Papers

Loading similar papers…