Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model

  • Anssi Klapuri
  • 2008


A method is described for estimating the fundamental frequencies of several concurrent sounds in polyphonic music and multiple-speaker speech signals. The method consists of a computational model of the human auditory periphery, followed by a periodicity analysis mechanism where fundamental frequencies are iteratively detected and canceled from the mixture signal. The auditory model needs to be computed only once, and a computationally efficient strategy is proposed for implementing it. Simulation experiments were made using mixtures of musical sounds and mixed speech utterances. The proposed method outperformed two reference methods in the evaluations and showed a high level of robustness in processing signals where important parts of the audible spectrum were deleted to simulate bandlimited interference. Different system configurations were studied to identify the conditions where pitch analysis using an auditory model is advantageous over conventional time or frequency domain approaches.

DOI: 10.1109/TASL.2007.908129

Extracted Key Phrases

14 Figures and Tables

Showing 1-10 of 58 references

Design and evaluation of doubletransform pitch determination algorithms with nonlinear distortion in the frequency domain-preliminary results

  • H Indefrey, W Hess, G Seeser
  • 1985
Highly Influential
13 Excerpts

Polyphonic pitch extraction from musical signals

  • P Lepain
  • 1999
Highly Influential
1 Excerpt

Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model for auditory processing

  • A De Cheveigné
  • 1993
Highly Influential
4 Excerpts
Showing 1-10 of 71 extracted citations


Citations per Year

112 Citations

Semantic Scholar estimates that this publication has received between 78 and 165 citations based on the available data.

See our FAQ for additional information.