Corpus ID: 52016793

Improved Chord Recognition by Combining Duration and Harmonic Language Models

@article{Korzeniowski2018ImprovedCR,
  title={Improved Chord Recognition by Combining Duration and Harmonic Language Models},
  author={Filip Korzeniowski and Gerhard Widmer},
  journal={ArXiv},
  year={2018},
  volume={abs/1808.05335}
}
Chord recognition systems typically comprise an acoustic model that predicts chords for each audio frame, and a temporal model that casts these predictions into labelled chord segments. However, temporal models have been shown to only smooth predictions, without being able to incorporate musical information about chord progressions. Recent research discovered that it might be the low hierarchical level such models have been applied to (directly on audio frames) which prevents learning musical… Expand
Towards CNN-based Acoustic Modeling of Seventh Chords for Automatic Chord Recognition
In this paper, we build upon a recently proposed deep convolutional neural network architecture for automatic chord recognition (ACR). We focus on extending the commonly used major/minor vocabularyExpand
Large-vocabulary Chord Transcription Via Chord Structure Decomposition
TLDR
A new model for audio chord recognition under a huge chord vocabulary, to decompose any chord label into a set of musically meaningful components, each with a much smaller vocabulary compared to the size of the overall chord vocabulary is proposed. Expand
Harmony Transformer: Incorporating Chord Segmentation into Harmony Recognition
TLDR
The Harmony Transformer is proposed, a multi-task music harmony analysis model aiming to improve chord recognition through incorporating chord segmentation into the recognition process using end-to-end sequence learning. Expand
A Bi-Directional Transformer for Musical Chord Recognition
TLDR
It turns out that the proposed bi-directional Transformer for chord recognition was able to divide segments of chords by utilizing adaptive receptive field of the attention mechanism, and it was observed that the model was ability to effectively capture long-term dependencies, making use of essential information regardless of distance. Expand
Multi-Step Chord Sequence Prediction Based On Aggregated Multi-Scale Encoder-Decoder Networks
TLDR
This paper studies the prediction of chord progressions for jazz music by relying on machine learning models and proposes new architectures based on an iterative temporal aggregation of input labels, which outperforms existing methods in terms of accuracy and perplexity, while requiring relatively few parameters. Expand
Attend to Chords: Improving Harmonic Analysis of Symbolic Music Using Transformer-Based Models
TLDR
Experiments on conventional ACR and advanced functional harmony recognition indicate that the HT has the potential to surpass the BTC, especially in terms of chord segmentation quality, and an improved model is proposed. Expand
Blending Acoustic and Language Model Predictions for Automatic Music Transcription
TLDR
This paper introduces a method for converting an input probabilistic piano roll (the output of a typical multipitch detection model) into a binary piano roll, and reports a statistically significant improvement over HMM decoding in terms of notewise F-measure with both timesteps. Expand
A MODULAR SYSTEM FOR THE HARMONIC ANALYSIS OF MUSICAL SCORES USING A LARGE VOCABULARY
The harmonic analysis of a musical composition is a fundamental step towards understanding its structure. Central to this analysis is the labeling of segments of a piece with chord symbols and localExpand
Tatum-Level Drum Transcription Based on a Convolutional Recurrent Neural Network with Language Model-Based Regularized Training
TLDR
A neural drum transcription method that detects from music signals the onset times of drums at the tatum level, where tatum times are assumed to be estimated in advance and a regularized training method for a frame-to-tatum DNN is proposed. Expand
Chord Jazzification: Learning Jazz Interpretations of Chord Symbols
TLDR
This work decomposes chord jazzification into coloring and voicing, and shows that the twostage process outperforms an end-to-end generation approach in modeling chord configurations, and attentionbased models are better at capturing the structure of chord sequences in comparison with recurrent neural networks. Expand
...
1
2
...

References

SHOWING 1-10 OF 34 REFERENCES
On the Futility of Learning Complex Frame-Level Language Models for Chord Recognition
TLDR
This paper argues that learning complex temporal models at the level of audio frames is futile on principle, and that non-Markovian models do not perform better than their first-order counterparts, and supports this argument through three experiments on the McGill Billboard dataset. Expand
Automatic Chord Recognition with Higher-Order Harmonic Language Modelling
TLDR
The results show that model overconfidence plays only a minor role (but target smoothing still improves the acoustic model), and that stronger chord language models do improve recognition results, however their effects are small compared to other domains. Expand
Simultaneous Estimation of Chords and Musical Context From Audio
TLDR
This work devise a fully automatic method to simultaneously estimate from an audio waveform the chord sequence including bass notes, the metric positions of chords, and the key, and introduces a measure of segmentation quality and shows that bass and meter modeling are especially beneficial for obtaining the correct level of granularity. Expand
Audio Chord Recognition with a Hybrid Recurrent Neural Network
TLDR
The ability of feed forward deep neural networks (DNNs) to learn discriminative features directly from a time-frequency representation of the acoustic signal, eliminating the need for a complex feature extraction stage. Expand
A Large-Scale Study of Language Models for Chord Prediction
TLDR
In contrast to static $N$-gram models, certain RNN configurations adapt to the songs at test time: this finding constitutes a further step towards the development of chord recognition systems that are more aware of local musical context than what was previously possible. Expand
Chord Recognition Using Duration-explicit Hidden Markov Models
TLDR
It is found that the state-of-the-art recognition result can be improved upon by using several duration distributions, which are found automatically by clustering song-level duration histograms. Expand
Structured Training for Large-Vocabulary Chord Recognition
TLDR
A deep convolutional-recurrent model for automatic chord recognition over a vocabulary of 170 classes is developed, trained to produce both the time-varying chord label sequence as well as binary encodings of chord roots and qualities. Expand
Audio Chord Recognition with Recurrent Neural Networks
TLDR
An efficient algorithm to search for the global mode of the output distribution while taking long-term dependencies into account is devised and the resulting method is competitive with state-of-the-art approaches on the MIREX dataset in the major/minor prediction task. Expand
An Expert Ground Truth Set for Audio Chord Recognition and Music Analysis
TLDR
Working with a team of trained jazz musicians, time-aligned transcriptions of the harmony in more than a thousand songs selected randomly from the Billboard “Hot 100” chart in the United States between 1958 and 1991 are collected. Expand
Feature Learning for Chord Recognition: The Deep Chroma Extractor
TLDR
The argument that chroma vectors potentially hold enough information to model harmonic content of audio for chord recognition, but that standard chroma extractors compute too noisy features is presented, and a learned chroma feature extractor based on artificial neural networks is proposed. Expand
...
1
2
3
4
...