Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification


We present a new technique for audio signal comparison based on tonal subsequence alignment and its application to detect cover versions (i.e., different performances of the same underlying musical piece). Cover song identification is a task whose popularity has increased in the music information retrieval (MIR) community along in the past, as it provides a direct and objective way to evaluate music similarity algorithms. This paper first presents a series of experiments carried out with two state-of-the-art methods for cover song identification. We have studied several components of these (such as chroma resolution and similarity, transposition, beat tracking or dynamic time warping constraints), in order to discover which characteristics would be desirable for a competitive cover song identifier. After analyzing many cross-validated results, the importance of these characteristics is discussed, and the best performing ones are finally applied to the newly proposed method. Multiple evaluations of this one confirm a large increase in identification accuracy when comparing it with alternative state-of-the-art approaches.

DOI: 10.1109/TASL.2008.924595

Extracted Key Phrases

18 Figures and Tables

Citations per Year

173 Citations

Semantic Scholar estimates that this publication has 173 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Serr2008ChromaBS, title={Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification}, author={Joan Serr{\`a} and Emilia G{\'o}mez and Perfecto Herrera and Xavier Serra}, journal={IEEE Transactions on Audio, Speech, and Language Processing}, year={2008}, volume={16}, pages={1138-1151} }