Streamlined Tempo Estimation Based on Autocorrelation and Cross-correlation With Pulses

  title={Streamlined Tempo Estimation Based on Autocorrelation and Cross-correlation With Pulses},
  author={Graham Percival and George Tzanetakis},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  • G. Percival, G. Tzanetakis
  • Published 1 December 2014
  • Computer Science
  • IEEE/ACM Transactions on Audio, Speech, and Language Processing
Algorithms for musical tempo estimation have become increasingly complicated in recent years. These algorithms typically utilize two fundamental properties of musical rhythm: some features of the audio signal are self-similar at periods related to the underlying rhythmic structure, and rhythmic events tend to be spaced regularly in time. We present a streamlined tempo estimation method ( stem) that distills ideas from previous work by reducing the number of steps, parameters, and modeling… 

Figures and Tables from this paper

Music Tempo Estimation Using Sub-Band Synchrony
A new tempo estimation method that utilizes coherent energy changes across multiple frequency sub-bands to identify the onsets and experimental results indicate a reasonable improvement in performance when compared to conventional methods of tempo estimation.
Tempo Octave Correction Using Multiclass Support Vector Machine
  • O. Dutta
  • Computer Science
    2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT)
  • 2018
A method base+octv to correct wrong estimate of tempo estimates by using an octave classifier and five features extracted from the audio are used to predict the octave error class which leads to corrected tempo.
Accurate Tempo Estimation Based on Recurrent Neural Networks and Resonating Comb Filters
A new tempo estimation algorithm which uses a bank of resonating comb filters to determine the dominant periodicity of a musical excerpt by simply reporting the highest resonator’s histogram peak.
Musical Tempo Estimation Using a Multi-scale Network
A multi-scale structure is introduced as the overall network architecture where information from different scales is aggregated to strengthen contextual feature learning and a Grouped Attention Module is proposed as the key component of the network.
Energy-Weighted Multi-Band Novelty Functions for Onset Detection in Piano Music
The use of energy-based weighting of multi-band onset detection functions and the use of a new criterion for adapting the final peak-picking threshold are shown to improve the detection of soft onsets in the vicinity of loud notes.
DAW-Integrated Beat Tracking for Music Production
Rhythm analysis is a well researched area in music information retrieval that has many useful applications in music production. In particular, it can be used to synchronize the tempo of audio
A Single-Step Approach to Musical Tempo Estimation Using a Convolutional Neural Network
A single-step musical tempo estimation system based solely on a convolutional neural network that can be used to identify and visualize tempo drift in musical performances and performs as well as or better than other state-of-the-art algorithms.
Phase-Aware Joint Beat and Downbeat Estimation Based on Periodicity of Metrical Structure
A phase-aware joint beat and downbeat estimation method mainly intended for popular music with a periodic metrical structure and steady tempo that trains a DNN that estimates the beat phase at each frame whose period is equal to the beat interval.
Extending Deep Rhythm for Tempo and Genre Estimation Using Complex Convolutions, Multitask Learning and Multi-input Network
This paper focuses on the Deep Rhythm system based on a harmonic representation of rhythm used as an input to a convolutional neural network, and to consider the relationships between frequency bands, processes complex-valued inputs through complex-convolutions.
OBTAIN: Real-Time Beat Tracking in Audio Signals
In simulations, it can be seen that the proposed algorithm, Online Beat TrAckINg (OBTAIN), outperforms state-of-art results in terms of prediction accuracy while maintaining comparable and practical computational complexity.


An effective, simple tempo estimation method based on self-similarity and regularity
  • G. Tzanetakis, G. Percival
  • Computer Science
    2013 IEEE International Conference on Acoustics, Speech and Signal Processing
  • 2013
This work believes that it has conducted the most comprehensive evaluation to date of tempo induction algorithms in terms of number of datasets and tracks and has similar performance to the best-performing ones.
Cyclic tempogram—A mid-level tempo representation for musicsignals
This paper presents a robust mid-level representation that encodes local tempo information and introduces the concept of cyclic tempograms, where tempi differing by a power of two are identified.
Extracting Predominant Local Pulse Information From Music Recordings
A novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music and can be used as a flexible tool for enhancing tempo estimation and beat tracking.
Music tempo estimation and beat tracking by applying source separation and metrical relations
T tempo estimation and beat tracking algorithms are presented by utilizing percussive/harmonic separation of the audio signal, in order to extract filterbank energies and chroma features from the respective components.
Context-Dependent Beat Tracking of Musical Audio
A simple and efficient method for beat tracking of musical audio using a two state model that replicates the human ability of tapping in time to music at significantly reduced computational cost.
Identifying Metrical and Temporal Structure With an Autocorrelation Phase Matrix
It is argued that the Autocorrelation Phase Matrix (APM) is an effective data structure for tempo prediction and related applications, such as real-time beat induction and music analysis.
Rhythmic Pattern Modeling for Beat and Downbeat Tracking in Musical Audio
The results showed that explicitly modeling rhythmic patterns of dance styles drastically reduces octave errors (detection of half or double tempo) and substantially improves downbeat tracking.
Automatic Extraction of Tempo and Beat From Expressive Performances
It is shown that estimating the perceptual salience of rhythmic events significantly improves the results of a computer program which is able to estimate the tempo and the times of musical beats in expressively performed music.
In the work presented in this paper, a user study exploring the perception of Beat Strength was conducted and the results were used to calibrate and explore automatic Beat Strength measures based on the calculation of Beat Histograms.
Onset Detection Revisited
Various methods have been proposed for detecting the onset times of musical notes in audio signals. We examine recent work on onset detection using spectral features such as the magnitude, phase and