A hierarchical system for audio classiication and retrieval based on audio content analysis is presented in this paper. The system consists of three stages. The rst stage is called the coarse-level audio classiication and segmenta-tion, where audio recordings are classiied and segmented into speech, music, several types of environmental sounds, and silence,(More)
A real-time audio segmentation and indexing scheme is presented in this paper. Audio recordings are segmented and classified into basic audio types such as silence, speech, music, song, environmental sound, speech with the music background, environmental sound with the music background, etc. Simple audio features such as the energy function, the average(More)
We present a method for the classification of sound effects which exploits time-frequency analysis of audio signals and uses the hidden Markov model as the classifier. The proposed approach can be used to retrieve audio/video segments in studios, audiovisual libraries, and family entertainment applications. For example, video scenes of gun fight can be(More)
The singer's information is essential in organizing, browsing and retrieving music collections. In this technical report, a system for automatic singer identification is developed which recognizes the singer of a song by analyzing the music signal. Meanwhile, songs which are similar in terms of singer's voice are clustered. The proposed scheme follows the(More)
—While current approaches for audiovisual data segmentation and classification are mostly focused on visual cues, audio signals may actually play a more important role in content parsing for many applications. An approach to automatic segmen-tation and classification of audiovisual data based on audio content analysis is proposed. The audio signal from(More)
An online audio classiication and segmentation system is presented in this research, where audio recordings are classiied and segmented into speech, music, several types of environmental sounds and silence based on audio content analysis. This is the rst step of our continuing work towards a general content-based audio classiication and retrieval system.(More)