Shape-based spectral contrast descriptor
@inproceedings{Akkermans2009ShapebasedSC, title={Shape-based spectral contrast descriptor}, author={Vincent Akkermans and Joan Serr{\`a} and Perfecto Herrera}, year={2009} }
Mel-frequency cepstral coefficients are used as an abstract representation of the spectral envelope of a given signal. Although they have been shown to be a powerful descriptor for speech and music signals, more accurate and easily interpretable options can be devised. In this study, we present and evaluate the shape-based spectral contrast descriptor, which is build up from the previously proposed octave-based spectral contrast descriptor. We compare the three aforementioned descriptors with…
16 Citations
Retrieval of Drum Samples by High-Level Descriptors
- Computer Science
- 2017
A drum samples retrieval tool could be created using classification techniques, by the application of learning algorithms and audio feature selections on two datasets, achieving good model and prediction accuracies.
Predictability of Music Descriptor Time Series and its Application to Cover Song Detection
- Computer ScienceIEEE Transactions on Audio, Speech, and Language Processing
- 2012
It is demonstrated that music predictability at relatively long time intervals can be exploited in a real-world application, namely the automatic identification of cover songs, and yields a parameter-free approach for cover song identification that is substantially faster, allows for reduced computational storage and still maintains highly competitive accuracies when compared to state-of-the-art systems.
Detection of genre-specific musical instruments: The case of the mellotron
- Computer Science
- 2011
This document addresses the problem of detection of musical instruments in polyphonic audio, exemplifying this specific task by analyzing the mellotron, a vintage sampler used in popular music, and shows that the methodology used proves to be effective for solving the task.
Identification of versions of the same musical composition by processing audio descriptions
- Computer Science
- 2011
This work proposes a system for version identification that is robust to the main musical changes between versions, including timbre, tempo, key and structure changes, and builds and study a complex network of versions and applies clustering and community detection algorithms.
Stem audio mixing as a content-basec transformation of audio features
- Computer Science2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)
- 2017
The underlying characteristics of manipulating raw recordings into individual stems can be described by this selected set of features, which are used to train various classification models and which lead to a better classification accuracy.
Cross-Version Singing Voice Detection in Classical Opera Recordings
- Computer ScienceISMIR
- 2015
A novel audio feature is proposed that extends a state-of-theart feature set that has previously been applied to singing voice detection in popular music recordings and a simple bootstrapping procedure is described that helps to improve the results in the case that the test data is not reflected well by the training data.
Analysis and Prediction of the Audio Feature Space when Mixing Raw Recordings into Individual Stems
- Computer Science
- 2017
This work investigates stem processing as a content-based transformation, where the inherent content of raw recordings leads us to predict the change of feature values that occurred within the transformation.
Improving voice activity detection in movies
- Computer ScienceINTERSPEECH
- 2015
A set of lightweight features that are specifically designed to perform under such conditions, while at the same time preventing confusions of singing voice with speech are proposed.
Intonation Analysis of Rāgas in Carnatic Music
- Computer Science
- 2014
This paper first assesses rga intonation qualitatively by analysing varṇaṁs, a particular form of Carnatic music compositions, and proposes two approaches based on the parametrization of pitch-value distributions: performance pitch histograms, and context-based svara distributions obtained by categorizing pitch contoursbased on the melodic context.
Intonation analysis of rāgas in Carnatic music 1
- Computer Science
- 2014
This paper first assesses rāga intonation qualitatively by analyzing varṇaṁs, a particular form of Carnatic music compositions, and proposes two approaches based on the parametrization of pitch-value distributions: performance pitch histograms, and context-based svara distributions obtained by categorizing pitch contoursbased on the melodic context.
References
SHOWING 1-10 OF 17 REFERENCES
Music type classification by spectral contrast feature
- PhysicsProceedings. IEEE International Conference on Multimedia and Expo
- 2002
The octave-based spectral contrast feature is proposed to represent the spectral characteristics of a music clip and represented the relative spectral distribution instead of average spectral envelope.
On the robustness of audio features for musical instrument classification
- Computer Science2008 16th European Signal Processing Conference
- 2008
The robustness of several audio features applied exemplarily to musical instrument classification are examined and comparison of rankings obtained by feature selection techniques, qualitative evaluation of changes in statistical parameters, and classification experiments using Gaussian Mixture Models are performed.
Quantitative Analysis of a Common Audio Similarity Measure
- Computer ScienceIEEE Transactions on Audio, Speech, and Language Processing
- 2009
A nearest neighbor classifier using the Kullback-Leibler divergence between Gaussian mixture models of songs' melfrequency cepstral coefficients is analyzed analytically and experimentally by the use of synthesized MIDI files, and it is found that it is highly sensitive to different instrument realizations.
Features and classifiers for the automatic classification of musical audio signals
- Computer ScienceISMIR
- 2004
The classification results show that the topology of the new classifier gives it a significant advantage over other classifiers, by allowing the classifier to model much more complex distributions within the data than Gaussian schemes do.
Musical genre classification of audio signals
- Computer ScienceIEEE Trans. Speech Audio Process.
- 2002
The automatic classification of audio signals into an hierarchy of musical genres is explored and three feature sets for representing timbral texture, rhythmic content and pitch content are proposed.
Computational Models of Music Similarity and their Application in Music Information Retrieval
- Computer Science
- 2006
The combination of different approaches is optimized and the largest evaluation of music similarity measures published to date is presented, which confirms that genrebased evaluations are suitable to efficiently evaluate large parameter spaces.
Representing Musical Genre: A State of the Art
- Art
- 2003
This article discusses the various approaches in representing musical genre, and proposes to classify these approaches in three main categories: manual, prescriptive and emergent approaches.
Content-Based Music Information Retrieval: Current Directions and Future Challenges
- ArtProceedings of the IEEE
- 2008
The problems of content-based music information retrieval are outlined and the state-of-the-art methods using audio cues and other cues are explored, and some of the major challenges for the coming years are identified.
Music Retrieval: A Tutorial and Review
- Computer ScienceFound. Trends Inf. Retr.
- 2006
An overview of the techniques for music processing, which are commonly exploited in many approaches, is presented and a description of the initial efforts and evaluation campaigns for MIR is provided.
Automatic genre classification of music content: a survey
- Computer ScienceIEEE Signal Processing Magazine
- 2006
The state-of-the-art in automatic genre classification of music collections through three main paradigms: expert systems, unsupervised classification, and supervised classification is reviewed.