PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise

@article{Gonzalez2014PEFACA,
  title={PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise},
  author={Sira Gonzalez and Mike Brookes},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  year={2014},
  volume={22},
  pages={518-530}
}
  • S. GonzalezM. Brookes
  • Published 1 February 2014
  • Computer Science
  • IEEE/ACM Transactions on Audio, Speech, and Language Processing
We present PEFAC, a fundamental frequency estimation algorithm for speech that is able to identify voiced frames and estimate pitch reliably even at negative signal-to-noise ratios. The algorithm combines a normalization stage, to remove channel dependency and to attenuate strong noise components, with a harmonic summing filter applied in the log-frequency power spectral domain, the impulse response of which is chosen to sum the energy of the fundamental frequency harmonics while attenuating… 

Voiced/Unvoiced Classification and Pitch Estimation Based on Amplitude Compression Filter

The simulated results show that the proposed method efficiently reduces voiced/unvoiced and pitch estimation error, and it is superior to some of the state-of-the-art method in the real environment.

A Pitch Estimation Method Robust to High Levels of Noise

By simulation experiments, it is shown that the proposed pitch detection method has more accurate and more low algorithm complexity than the traditional methods at both high and low SNR.

Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering

A new algorithm is presented that integrates the three subtasks of voiced speech, the estimation of the fundamental frequency and the tracking of pitch values over time into a single procedure and compares favorably with current, state-of-the-art pitch detection algorithms.

Fundamental Frequency Informed Speech Enhancement in a Flexible Statistical Framework

A statistical estimator is derived that explicitly takes into account information about the characteristic structure of voiced speech by means of a harmonic signal model and outperforms several reference algorithms in terms of speech quality and intelligibility as predicted by instrumental measures.

Combining Zero Replacement Speech Enhancement with Lag Window Method for Pitch Detection

An anti- noise pitch detection method that combines a speech enhancement algorithm with a spectral flattening algorithm is proposed that has the lowest gross pitch error (GPE) rate among all the methods when dealing with white-noise added male speeches.

Estimation of speaker individual spectral envelope for pitch tracking improvement

This paper proposes to overcome some limitations of the PEFAC algorithm by employing an alternative enhancement procedure, which uses an estimation of the individual spectral envelope instead of using a universal function.

Pitch Estimation Algorithm for Narrowband Speech Signal using Phase Differences between Harmonics

  • Yuya HosodaA. KawamuraY. Iiguni
  • Engineering
    2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  • 2021
This paper proposes a pitch estimation algorithm for a narrowband speech signal using phase differences between har-monics. A narrowband speech signal has an incomplete harmonic structure due to

Robust Estimation of Fundamental Frequency Using Single Frequency Filtering Approach

A new method for robust estimation of fundamental frequency (F0) from speech signal is proposed in this paper. The method exploits the high SNR regions of speech in time and frequency domains in the

A Pitch-Synchronous Simultaneous Detection-Estimation Framework for Speech Enhancement

A pitch-synchronous stochastic-deterministic estimator outperforms several benchmark methods in terms of speech intelligibility and perceived quality predicted by instrumental measures for various noise types and different signal-to-noise ratios.
...

References

SHOWING 1-10 OF 55 REFERENCES

A Pitch Estimation Filter robust to high levels of noise (PEFAC)

PEFAC is presented, a fundamental frequency estimation algorithm that is able to identify the pitch of voiced frames reliably even at negative signal to noise ratios, and performs exceptionally well in both high and low levels of additive noise.

Maximum a-posteriori probability pitch tracking in noisy environments using harmonic model

This paper presents an optimal estimation procedure for sound signals (such as speech) that are modeled by harmonic sources, and achieves more robust and accurate estimation of voiced speech parameters.

YIN, a fundamental frequency estimator for speech and music.

An algorithm is presented for the estimation of the fundamental frequency (F0) of speech or musical sounds. It is based on the well-known autocorrelation method with a number of modifications that

Evaluation of pitch estimation in noisy speech for application in non-intrusive speech quality assessment

This paper evaluates the performance of four established state-of-the-art algorithms for pitch estimation in additive noise and reverberation and shows how accurate estimation of the pitch of a speech signal can influence objective speech quality measurement algorithms.

HMM-Based Multipitch Tracking for Noisy and Reverberant Speech

  • Z. JinDeliang Wang
  • Computer Science, Engineering
    IEEE Transactions on Audio, Speech, and Language Processing
  • 2011
This paper proposes a robust algorithm for multipitch tracking in the presence of both background noise and room reverberation, which can reliably detect single and double pitch contours in noisy and reverberant conditions.

Multi-Pitch Estimation

In this book, an introduction to pitch estimation is given and a number of statistical methods for pitch estimation are presented, which include both single- and multi-pitch estimators based on statistical approaches, like maximum likelihood and maximum a posteriori methods.

Super resolution pitch determination of speech signals

Based on a new similarity model for the voice excitation process, a novel pitch determination procedure is derived that has infinite (super) resolution, better accuracy than the difference limen for F/sub 0/, robustness to noise, reliability, and modest computational complexity.

Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model

  • A. Klapuri
  • Computer Science
    IEEE Transactions on Audio, Speech, and Language Processing
  • 2008
The proposed method outperformed two reference methods in the evaluations and showed a high level of robustness in processing signals where important parts of the audible spectrum were deleted to simulate bandlimited interference.

Joint High-Resolution Fundamental Frequency and Order Estimation

The presented method for joint estimation of the fundamental frequency and order of a set of harmonically related sinusoids based on the multiple signal classification (MUSIC) estimation criterion is shown to have an efficient implementation using fast Fourier transforms.

Dynamic programming algorithm for optimal estimation of speech parameter contours

  • H. Ney
  • Computer Science
    IEEE Transactions on Systems, Man, and Cybernetics
  • 1983
A method for incorporating the requirement of smoothness into the estimation procedure for speech parameters is described, and a recursive algorithm is obtained which does without statistical assumptions and is purely deterministic.
...