An overview of audio information retrieval

@article{Foote1999AnOO,
  title={An overview of audio information retrieval},
  author={Jonathan Foote},
  journal={Multimedia Systems},
  year={1999},
  volume={7},
  pages={2-10}
}
  • J. Foote
  • Published 1999
  • Computer Science
  • Multimedia Systems
Abstract. The problem of audio information retrieval is familiar to anyone who has returned from vacation to find an answering machine full of messages. While there is not yet an “AltaVista” for the audio data type, many workers are finding ways to automatically locate, index, and browse audio using recent advances in speech recognition and machine listening. This paper reviews the state of the art in audio information retrieval, and presents recent advances in automatic speech recognition… 
A method for direct audio search with applications to indexing and retrieval
  • Sue E. Johnson, P. Woodland
  • Computer Science
    2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)
  • 2000
TLDR
A technique for searching audio data to find an exact match for a given piece of cue-audio, which uses a cepstral parameterisation of the audio and a covariance-based distance metric to quickly locate direct repeats.
Direct access in content-based Audio Information Retrieval: A state of the art and challenges
This paper surveys Audio Information Retrieval (AIR) using a literature review and classification of articles from 1994 to 2010 with a keyword index and article abstract in order to explore how AIR
Information Retrieval from Unsegmented Broadcast News Audio
TLDR
A novel method of automatically detecting and removing commercials is presented and shown to increase the performance of the system while also reducing the computational effort required, and a sophisticated large vocabulary speech recogniser which produces high-quality transcriptions of the audio and a window-based retrieval system with post-retrieval merging are described.
A Survey on Audio Retrieval System forClassification
TLDR
It is said that content-based retrieval system permits more tolerance towards erroneous queries, as in these systems queries contain more errors; so for such search keys similarity search based on approximate matching produce batter results compare to exact matching.
Concept framework for audio information retrieval: ARF
TLDR
A concept framework (ARF) for content-based audio retrieval is proposed from systematic perspectives, which describes audio content model, audio retrieval architecture and audio query schemes.
Spoken Content Retrieval: A Survey of Techniques and Technologies
TLDR
This survey provides an overview of the field ofSCR encompassing component technologies, the relationship of SCR to text IR and automatic speech recognition and user interaction issues, and is aimed at researchers with backgrounds in speech technology or IR who are seeking deeper insight on how these fields are integrated to support research and development.
Semantic-based Audio Recognition and Retrieval
TLDR
This study demonstrates automated annotation of a sound with a string of semantically appropriate words and also retrieval of sounds most relevant to a given textual query by constructing acoustic and semantic spaces from a database of sound and description pairs and using statistical models to learn similarity in each space.
Content-Based Audio Indexing and Retrieval : an overview
TLDR
Some limitations and constrains of existing Query By Example (QBE) and Query By Humming) CBAIR systems are explored and hypothetical solutions for the existing problems in this area are proposed to improve the performance.
AN OVERVIEW OF AUDIO CLASSIFIER METHODS AND TECHNIQUES
TLDR
Some limitations and constrains of existing Query by Example and Query by Humming CBAIR systems are explored and hypothetical solutions for the existing problems in this area are proposed to improve the performance.
...
...

References

SHOWING 1-10 OF 83 REFERENCES
Content-based retrieval of music and audio
  • J. Foote
  • Computer Science
    Other Conferences
  • 1997
TLDR
A system to retrieve audio documents y acoustic similarity based on statistics derived from a supervised vector quantizer, rather than matching simple pitch or spectral characteristics, which may be applicable to image retrieval as well.
Informedia: news-on-demand multimedia information acquisition and retrieval
TLDR
The News-on-Demand application created within the InformediaTM Digital Video Library project is described and how speech recognition is used for transcript creation from video, time alignment of closed-captioned transcripts, a speech query interface, and audio paragraph segmentation is discussed.
Speech Retrieval Based on Automatic Indexing
TLDR
A system that retrieves audio recordings containing spoken text in response to a given textual query is presented, and indexing methods that automatically describe the content of the recordings are described.
Speaker identification based text to audio alignment for an audio retrieval system
  • D. Roy, C. Malamud
  • Computer Science
    1997 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1997
TLDR
An audio retrieval system which lets Internet users efficiently access a large audio database containing recordings of the proceedings of the United States House of Representatives using a novel method based on speaker identification that has been successfully integrated into a World Wide Web based search and browse system.
Techniques for Information Retrieval from Speech Messages
  • Re.
  • Computer Science
  • 1991
TLDR
The first demonstration of a complete system that accepts speech messages as input and produces an estimated message class as output is presented, and the promising results obtained in information retrieval on conversational speech messages demonstrate the feasibility of the technology.
Acoustic indexing for multimedia retrieval and browsing
TLDR
The VMR project has shown, both experimentally and through the construction of a working prototype, that speech recognition can be combined with information retrieval methods to locate multimedia documents by content.
Open-vocabulary speech indexing for voice and video mail retrieval
TLDR
Novel techniques that allow extremely rapid audio indexing, at rates approaching several thousand times real time are presented, which allow rapid and efficient retrieval and browsing of audio and video documents.
Speech for multimedia information retrieval
TLDR
The fully-automatic News-on-Demand system monitors TV news and allows selective retrieval of news items based on spoken queries, and runs on a Pentium PC using MPEG-I video compression and the Sphinx-II continuous speech recognition system.
Query by humming: musical information retrieval in an audio database
TLDR
A system for querying an audio database by humming is described along with a scheme for representing the melodic information in a song as relative pitch changes, and the performance results of system indicating its effectiveness are presented.
...
...