Improving Audio Retrieval through Loudness Profile Categorization

@article{Parekh2016ImprovingAR,
  title={Improving Audio Retrieval through Loudness Profile Categorization},
  author={Sanjeel Parekh and Frederic Font and Xavier Serra},
  journal={2016 IEEE International Symposium on Multimedia (ISM)},
  year={2016},
  pages={565-568}
}
The increasing popularity of audio content sharing in online platforms requires the development of techniques to better organize and retrieve this data. In this paper we look at how to improve similarity search through content categorization in the context of Freesound, a popular online sound sharing site. We focus on organization based on morphological description. In particular, we propose to improve search results by incorporating information about query sound's loudness profile. This is… 
1 Citations

Figures and Tables from this paper

Pattern analysis based acoustic signal processing: a survey of the state-of-art
TLDR
The aim of this state-of-art paper is to produce a summary and guidelines for using the broadly used methods, to identify the challenges as well as future research directions of acoustic signal processing.

References

SHOWING 1-10 OF 10 REFERENCES
Sound Effect Taxonomy Management in Production Environments
TLDR
A classification scheme for sound effects management inspired on the MPEG-7 standard on top of an existing lexical network, WordNet, showing the usefulness of the approach easing the task for the librarian and providing higher control on the search and retrieval for the user.
Morphological Sound Description: Computational Model and Usability Evaluation
TLDR
A description framework focusing on intrinsic perceptual sound qualities is proposed, based on Schaeffer's research on sound objects, which could be used to represent and retrieve abstract sounds and to refine traditional search by source for non-abstract sounds.
Using morphological description for generic sound retrieval
TLDR
This work proposes a description framework, based on Schaeffer’s research on a generalised solfeggio which could be applied to any type of sounds, and describes more specifically experiments on automatic extraction of morphological descriptors.
Freesound technical demo
TLDR
This demo wants to introduce Freesound to the multimedia community and show its potential as a research resource.
Sound Indexing Using Morphological Description
TLDR
This paper considers three morphological descriptions: dynamic profiles (ascending, descending, ascending/descending, stable, impulsive), melodic profiles (up, down,stable, up/down, down/up) and complex-iterative sound description (non-iteratives, iterative, grain, repetition).
Essentia: An Audio Analysis Library for Music Information Retrieval
Comunicacio presentada a la 14th International Society for Music Information Retrieval Conference, celebrada a Curitiba (Brasil) els dies 4 a 8 de novembre de 2013.
ESSENTIA: an open-source library for sound and music analysis
TLDR
Essentia 2.0, an open-source C++ library for audio analysis and audio-based music information retrieval, is presented, which contains an extensive collection of reusable algorithms which implement audio input/output functionality, standard digital signal processing blocks, statistical characterization of data, and a large set of spectral, temporal, tonal and high-level music descriptors.
Traité des objets musicaux
TLDR
This dissertation aims to provide a history of music criticism in the post-modern period from 1989 to 2002, a period chosen in order to explore its roots as well as specific cases up to and including the year in which Pierre Schaeffer died.
An examination and interpretation of itu-r bs. 1387: Perceptual evaluation of audio quality
  • TSP Lab Technical Report, Dept. Electrical & Computer Engineering, McGill University, pp. 1–89, 2002.
  • 2002
Guide to sound objects. pierre schaeffer and musical research
  • Trans. John Dack and Christine North), http://www. ears. dmu. ac. uk, 1983.
  • 1983