Content-Based Classification, Search, and Retrieval of Audio

  title={Content-Based Classification, Search, and Retrieval of Audio},
  author={Erling Henry Wold and Thom Blum and Douglas Keislar and James Wheaton},
  journal={IEEE Multim.},
Many audio and multimedia applications would benefit from the ability to classify and search for audio based on its characteristics. The audio analysis, search, and classification engine described here reduces sounds to perceptual and acoustical features. This lets users search or retrieve sounds by any one feature or a combination of them, by specifying previously learned classes based on these features, or by selecting or entering reference sounds and asking the engine to retrieve similar or… Expand

Figures and Topics from this paper

Content-based retrieval of music and audio
  • J. Foote
  • Computer Science, Engineering
  • Other Conferences
  • 1997
A system to retrieve audio documents y acoustic similarity based on statistics derived from a supervised vector quantizer, rather than matching simple pitch or spectral characteristics, which may be applicable to image retrieval as well. Expand
Applying neural network on the content-based audio classification
  • Xi Shao, Changsheng Xu, M. Kankanhalli
  • Computer Science
  • Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint
  • 2003
This paper describes a novel content-based audio classification approach based on neural network and genetic algorithm that achieves a good performance of the classification. Expand
Features for Content-Based Audio Retrieval
The goal of this chapter is to review latest research in the context of audio feature extraction and to give an application-independent overview of the most important existing techniques, and to propose a novel taxonomy for the organization of audio features. Expand
Intelligent Content-Based Audio Classiflcation and Retrieval for Web Applications
This chapter discusses the issues involved in the content-based audio classification and retrieval, including spoken document retrieval and music information retrieval and concludes that the emerging audio ontology can be applied in fast growing Internet, digital libraries, and other multimedia systems. Expand
Boosting for content-based audio classification and retrieval: an evaluation
A recently proposed algorithm in machine learning called AdaBoost for content-based audio classification and retrieval is evaluated, which is a kind of large margin classifiers and is efficient for on-line learning. Expand
Content-based Retrieval for Digital Audio and Music
This chapter covers the research aspects of audio feature extraction, generic audio classification and retrieval, music content analysis, and content-based music retrieval, providing an overview of current research in the area. Expand
Indexing and Retrieval of Audio: A Survey
  • Goujun Lu
  • Computer Science
  • Multimedia Tools and Applications
  • 2004
This paper provides a comprehensive survey of audio indexing and retrieval techniques and describes main audio characteristics and features and discusses techniques for classifying audio into speech and music based on these features. Expand
SVM-Based Audio Classification for Content- Based Multimedia Retrieval
The experimental results show that the proposed SVM approach to classify audio signals into six classes not only improves classification accuracy, but also performs better than the other classification systems using the decision tree (DT), K Nearest Neighbor (K-NN) and Neural Network. Expand
Audio classification based on adaptive partitioning
Improvements in the accuracy of audio classification are largely due to the partitioning of the input audio file into homogeneous segments while the incorporation of new class detection offers greater flexibility of use. Expand
Audio classification using acoustic images for retrieval from multimedia databases
  • I. Paraskevas, E. Chilton
  • Computer Science
  • Proceedings EC-VIP-MC 2003. 4th EURASIP Conference focused on Video/Image Processing and Multimedia Communications (IEEE Cat. No.03EX667)
  • 2003
A novel method for the automatic recognition of acoustic utterances is presented using acoustic images as the basis for the feature extraction that effectively employs the spectrogram, the Wigner-Ville distribution and co-occurrence matrices. Expand


Automatic indexing of a sound database using self-organizing neural nets
One of the main problems in sound synthesis is that the composer's idea or concept of a sound does not necessarily correspond directly to the physical parameters of synthesis algorithms. In regard toExpand
Toward an Intelligent Editor of Digital Audio: Signal Processing Methods
Signal processing methods that have been developed for use in an automatic music analysis system are described and sample results of some promising strategies for accomplishing these goals are presented. Expand
Video and Image Processing in Multimedia Systems
This chapter discusses image and Video Indexing and Retrieval techniques for Multimedia Compression, and some of the techniques used in this chapter were developed in the second part of this book. Expand
Aspects of tone sensation : a psychophysical study
One of the books you can enjoy now is aspects of tone sensation a psychophysical study here. Expand
Feiten and S . Gunzel , “ Automatic Indexing of a Sound Database Using Self - Organizing Neural Nets , ” Computer Music ]
  • “ Audio Databases with Content - Based Retrieval , “ workshop on Intelligent Multimedia Information Retrieval , 1995 Int ’ l Joint Conf . on Artificial Intelligence Video and Image Processing in Multimedia Systems
  • 1995
Noniinear Parameter Estimation ofAcoustic Models
  • Noniinear Parameter Estimation ofAcoustic Models
  • 1987
Towards an Intelligent Editor of Digital Audio: Signal Processing Methods
  • Computer Musicj
  • 1982
he cofounded the Computer Music Association and served for roughly 10 years as an associate editor of
  • Computer Music Journal
  • 1978