Adaptive whitening for Improved Real-Time audio onset Detection

Abstract

We describe a new method for preprocessing STFT phase-vocoder frames for improved performance in real-time onset detection, which we term " adaptive whitening ". The procedure involves normalising the magnitude of each bin according to a recent maximum value for that bin, with the aim of allowing each bin to achieve a similar dynamic range over time, which helps to mitigate against the influence of spectral roll-off and strongly-varying dynamics. Adaptive whitening requires no training, is relatively lightweight to compute, and can run in real-time. Yet it can improve onset detector performance by more than ten percentage points (peak F-measure) in some cases, and improves the performance of most of the onset detectors tested. We present results demonstrating that adaptive whitening can significantly improve the performance of various STFT-based onset detection functions, including functions based on the power, spectral flux, phase deviation, and complex deviation measures. Our results find the process to be especially beneficial for certain types of audio signal (e.g. complex mixtures such as pop music).

Extracted Key Phrases

7 Figures and Tables

Showing 1-10 of 21 references

Vic Firth Education Team and The Percussive Arts Society. 40 essential rudiments: The flam. www.vicfirth.com/education/ rudiments/20flam.html, retrieved 30th

  • 2007
1 Excerpt

Audio Onset Detection Results. www.music-ir.org/mirex2006/index. php/Audio Onset Detection Results, retrieved 30th

  • 2006
1 Excerpt

Automatic Annotation of Musical Audio for Interactive Applications

  • M Brossier
  • 2006
Showing 1-10 of 23 extracted citations