• Corpus ID: 17373503

Shape-based spectral contrast descriptor

@inproceedings{Akkermans2009ShapebasedSC,
  title={Shape-based spectral contrast descriptor},
  author={Vincent Akkermans and Joan Serr{\`a} and Perfecto Herrera},
  year={2009}
}
Mel-frequency cepstral coefficients are used as an abstract representation of the spectral envelope of a given signal. Although they have been shown to be a powerful descriptor for speech and music signals, more accurate and easily interpretable options can be devised. In this study, we present and evaluate the shape-based spectral contrast descriptor, which is build up from the previously proposed octave-based spectral contrast descriptor. We compare the three aforementioned descriptors with… 

Figures and Tables from this paper

Retrieval of Drum Samples by High-Level Descriptors
  • Computer Science
  • 2017
TLDR
A drum samples retrieval tool could be created using classification techniques, by the application of learning algorithms and audio feature selections on two datasets, achieving good model and prediction accuracies.
Predictability of Music Descriptor Time Series and its Application to Cover Song Detection
TLDR
It is demonstrated that music predictability at relatively long time intervals can be exploited in a real-world application, namely the automatic identification of cover songs, and yields a parameter-free approach for cover song identification that is substantially faster, allows for reduced computational storage and still maintains highly competitive accuracies when compared to state-of-the-art systems.
Detection of genre-specific musical instruments: The case of the mellotron
TLDR
This document addresses the problem of detection of musical instruments in polyphonic audio, exemplifying this specific task by analyzing the mellotron, a vintage sampler used in popular music, and shows that the methodology used proves to be effective for solving the task.
Identification of versions of the same musical composition by processing audio descriptions
TLDR
This work proposes a system for version identification that is robust to the main musical changes between versions, including timbre, tempo, key and structure changes, and builds and study a complex network of versions and applies clustering and community detection algorithms.
Stem audio mixing as a content-basec transformation of audio features
  • M. M. Ramírez, J. Reiss
  • Computer Science
    2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)
  • 2017
TLDR
The underlying characteristics of manipulating raw recordings into individual stems can be described by this selected set of features, which are used to train various classification models and which lead to a better classification accuracy.
Cross-Version Singing Voice Detection in Classical Opera Recordings
TLDR
A novel audio feature is proposed that extends a state-of-theart feature set that has previously been applied to singing voice detection in popular music recordings and a simple bootstrapping procedure is described that helps to improve the results in the case that the test data is not reflected well by the training data.
Analysis and Prediction of the Audio Feature Space when Mixing Raw Recordings into Individual Stems
TLDR
This work investigates stem processing as a content-based transformation, where the inherent content of raw recordings leads us to predict the change of feature values that occurred within the transformation.
Improving voice activity detection in movies
TLDR
A set of lightweight features that are specifically designed to perform under such conditions, while at the same time preventing confusions of singing voice with speech are proposed.
Intonation Analysis of Rāgas in Carnatic Music
TLDR
This paper first assesses rga intonation qualitatively by analysing varṇaṁs, a particular form of Carnatic music compositions, and proposes two approaches based on the parametrization of pitch-value distributions: performance pitch histograms, and context-based svara distributions obtained by categorizing pitch contoursbased on the melodic context.
Intonation analysis of rāgas in Carnatic music 1
TLDR
This paper first assesses rāga intonation qualitatively by analyzing varṇaṁs, a particular form of Carnatic music compositions, and proposes two approaches based on the parametrization of pitch-value distributions: performance pitch histograms, and context-based svara distributions obtained by categorizing pitch contoursbased on the melodic context.
...
...

References

SHOWING 1-10 OF 17 REFERENCES
Music type classification by spectral contrast feature
TLDR
The octave-based spectral contrast feature is proposed to represent the spectral characteristics of a music clip and represented the relative spectral distribution instead of average spectral envelope.
On the robustness of audio features for musical instrument classification
TLDR
The robustness of several audio features applied exemplarily to musical instrument classification are examined and comparison of rankings obtained by feature selection techniques, qualitative evaluation of changes in statistical parameters, and classification experiments using Gaussian Mixture Models are performed.
Quantitative Analysis of a Common Audio Similarity Measure
TLDR
A nearest neighbor classifier using the Kullback-Leibler divergence between Gaussian mixture models of songs' melfrequency cepstral coefficients is analyzed analytically and experimentally by the use of synthesized MIDI files, and it is found that it is highly sensitive to different instrument realizations.
Features and classifiers for the automatic classification of musical audio signals
TLDR
The classification results show that the topology of the new classifier gives it a significant advantage over other classifiers, by allowing the classifier to model much more complex distributions within the data than Gaussian schemes do.
Musical genre classification of audio signals
TLDR
The automatic classification of audio signals into an hierarchy of musical genres is explored and three feature sets for representing timbral texture, rhythmic content and pitch content are proposed.
Computational Models of Music Similarity and their Application in Music Information Retrieval
TLDR
The combination of different approaches is optimized and the largest evaluation of music similarity measures published to date is presented, which confirms that genrebased evaluations are suitable to efficiently evaluate large parameter spaces.
Representing Musical Genre: A State of the Art
TLDR
This article discusses the various approaches in representing musical genre, and proposes to classify these approaches in three main categories: manual, prescriptive and emergent approaches.
Content-Based Music Information Retrieval: Current Directions and Future Challenges
TLDR
The problems of content-based music information retrieval are outlined and the state-of-the-art methods using audio cues and other cues are explored, and some of the major challenges for the coming years are identified.
Music Retrieval: A Tutorial and Review
  • N. Orio
  • Computer Science
    Found. Trends Inf. Retr.
  • 2006
TLDR
An overview of the techniques for music processing, which are commonly exploited in many approaches, is presented and a description of the initial efforts and evaluation campaigns for MIR is provided.
Automatic genre classification of music content: a survey
TLDR
The state-of-the-art in automatic genre classification of music collections through three main paradigms: expert systems, unsupervised classification, and supervised classification is reviewed.
...
...