Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model

  • Anssi Klapuri
  • Published 2008 in
    IEEE Transactions on Audio, Speech, and Language…

Abstract

A method is described for estimating the fundamental frequencies of several concurrent sounds in polyphonic music and multiple-speaker speech signals. The method consists of a computational model of the human auditory periphery, followed by a periodicity analysis mechanism where fundamental frequencies are iteratively detected and canceled from the mixture signal. The auditory model needs to be computed only once, and a computationally efficient strategy is proposed for implementing it. Simulation experiments were made using mixtures of musical sounds and mixed speech utterances. The proposed method outperformed two reference methods in the evaluations and showed a high level of robustness in processing signals where important parts of the audible spectrum were deleted to simulate bandlimited interference. Different system configurations were studied to identify the conditions where pitch analysis using an auditory model is advantageous over conventional time or frequency domain approaches.

DOI: 10.1109/TASL.2007.908129

Extracted Key Phrases

13 Figures and Tables

Statistics

01020302008200920102011201220132014201520162017
Citations per Year

118 Citations

Semantic Scholar estimates that this publication has 118 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Klapuri2008MultipitchAO, title={Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model}, author={Anssi Klapuri}, journal={IEEE Transactions on Audio, Speech, and Language Processing}, year={2008}, volume={16}, pages={255-266} }