• Corpus ID: 8232502

ENHANCED BEAT TRACKING WITH CONTEXT-AWARE NEURAL NETWORKS

@inproceedings{Bck2011ENHANCEDBT,
  title={ENHANCED BEAT TRACKING WITH CONTEXT-AWARE NEURAL NETWORKS},
  author={Sebastian B{\"o}ck and Markus Schedl},
  year={2011}
}
We present two new beat tracking algorithms based on the autocorrelation analysis, which showed state-of-the-art performance in the MIREX 2010 beat tracking contest. Unlike the traditional approach of processing a list of onsets, we propose to use a bidirectional Long Short-Term Memory recurrent neural network to perform a frame by frame beat classification of the signal. As inputs to the network the spectral features of the audio signal and their relative differences are used. The network… 

Figures and Tables from this paper

Style-Specific Beat Tracking with Deep Neural Networks
TLDR
The proposed beat tracking system is based on temporal convolutional networks which capture the sequential structure of audio input and is evaluated on four datasets of various musical genres and styles and achieves state-of-the-art performance.
A Multi-model Approach to Beat Tracking Considering Heterogeneous Music Styles
TLDR
A new beat tracking algorithm which extends an existing state-of-the-art system with a multi-model approach to represent different music styles and is able to match even human tapping performance.
Joint Beat and Downbeat Tracking with Recurrent Neural Networks
TLDR
A recurrent neural network operating directly on magnitude spectrograms is used to model the metrical structure of the audio signals at multiple levels and provides an output feature that clearly distinguishes between beats and downbeats.
Music Signals Beat Tracking Based on Bidirectional Long-Short Term Memory
TLDR
A music signals beat tracking based on Bidirectional Long-Short Term Memory (BLSTM) method which shows that for the training data reach 86.3% f-measure accuracy, Ballroom data 73.4%, and GTZAN 65%.
Downbeat Tracking Using Beat Synchronous Features with Recurrent Neural Networks
TLDR
A system that extracts the downbeat times from a beat-synchronous audio feature stream of a music piece and shows on seven commonly used datasets of Western music that the system is able to achieve state-of-the-art results.
Beat Tracking with a Cepstroid Invariant Neural Network
TLDR
A novel rhythm tracking architecture that learns how to track tempo and beats through layered learning, and uses a neural network that is invariant with regards to the cepstroid length.
Predictive Real-Time Beat Tracking from Music for Embedded Application
TLDR
A real-time beat tracking algorithm, fast enough to be implemented on an embedded system, that can satisfactorily estimate beat positions from a music signal in real- time with an average continuity score of 0.67.
On Musical Onset Detection via the S-Transform
  • N. Silva, C. Weeraddana
  • Computer Science
    2018 52nd Asilomar Conference on Signals, Systems, and Computers
  • 2018
TLDR
This paper proposes a method to localize onset components in music by using the S-transform, and thus, the method is purely based on temporal/spectral data.
Temporal convolutional networks for musical audio beat tracking
TLDR
Three highly promising attributes of TCNs for music analysis are demonstrated, namely: they achieve state-of-the-art performance on a wide range of existing beat tracking datasets, they are well suited to parallelisation and thus can be trained efficiently even on very large training data, and they require a small number of weights.
Convolving Gaussian Kernels for RNN-Based Beat Tracking
TLDR
This work proposes to convolve the original ground truth with a Gaussian kernel as the target output of the network for a more robust training and shows that it can train a better or at least competitive model in a shorter time by using the convolved groundtruth with a proper Gaussian kernels.
...
...

References

SHOWING 1-10 OF 13 REFERENCES
Context-Dependent Beat Tracking of Musical Audio
TLDR
A simple and efficient method for beat tracking of musical audio using a two state model that replicates the human ability of tapping in time to music at significantly reduced computational cost.
A tutorial on onset detection in music signals
TLDR
Methods based on the use of explicitly predefined signal features: the signal's amplitude envelope, spectral magnitudes and phases, time-frequency representations, and methods based on probabilistic signal models are discussed.
Evaluation of the Audio Beat Tracking System BeatRoot
TLDR
Improvements to the original BeatRoot system and a large-scale evaluation and analysis of the system's performance are described, with results compared with other evaluations such as the MIREX 2006 Audio Beat Tracking Evaluation.
Robust on-line beat tracking with kalman filtering and probabilistic data association (KF-PDA)
TLDR
A Kalman filtering approach to on-line musical beat tracking with probabilistic data association (PDA) is investigated and superior performance of the proposed beat tracking algorithm is demonstrated with simulation results on the Music Information Retrieval Evaluation Exchange (MIREX) 2006 beat tracking practice dataset and the Billboard Top-10 database.
Particle Filtering Applied to Musical Tempo Tracking
TLDR
This paper explores the use of particle filters for beat tracking in musical audio examples with two alternative algorithms, one which performs Rao-Blackwellisation to produce an almost deterministic formulation while the second is a formulation which models tempo as a Brownian motion process.
Acoustic Cues to Beat Induction: A Machine Learning Perspective
TLDR
It is hypothesize that modeling beat induction may involve many different, complementary acoustic features and that the process of selecting relevant features should partly depend on acoustic properties of the very signal under consideration.
Long Short-Term Memory
TLDR
A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
Supervised Sequence Labelling with Recurrent Neural Networks
  • A. Graves
  • Computer Science
    Studies in Computational Intelligence
  • 2008
TLDR
A new type of output layer that allows recurrent networks to be trained directly for sequence labelling tasks where the alignment between the inputs and the labels is unknown, and an extension of the long short-term memory network architecture to multidimensional data, such as images and video sequences.
A Review of Automatic Rhythm Description Systems
This research was supported by the EU project FP6-507142 (SIMAC) and the national project Y99-INF sponsored by the Austrian Federal Ministryu of Education, Science and Culture in the form of a Start
Universal Onset Detection with Bidirectional Long Short-Term Memory Neural Networks
TLDR
This paper presents a new onset detector with superior performance and temporal precision for all kinds of music, including complex music mixes, based on auditory spectral features and relative spectral differences processed by a bidirectional Long Short-Term Memory recurrent neural network, which acts as reduction function.
...
...