PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise

@article{Gonzalez2014PEFACA,
  title={PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise},
  author={Sira Gonzalez and Mike Brookes},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  year={2014},
  volume={22},
  pages={518-530}
}
  • S. Gonzalez, M. Brookes
  • Published 1 February 2014
  • Computer Science
  • IEEE/ACM Transactions on Audio, Speech, and Language Processing
We present PEFAC, a fundamental frequency estimation algorithm for speech that is able to identify voiced frames and estimate pitch reliably even at negative signal-to-noise ratios. The algorithm combines a normalization stage, to remove channel dependency and to attenuate strong noise components, with a harmonic summing filter applied in the log-frequency power spectral domain, the impulse response of which is chosen to sum the energy of the fundamental frequency harmonics while attenuating… 
Voiced/Unvoiced Classification and Pitch Estimation Based on Amplitude Compression Filter
TLDR
The simulated results show that the proposed method efficiently reduces voiced/unvoiced and pitch estimation error, and it is superior to some of the state-of-the-art method in the real environment.
A Pitch Estimation Method Robust to High Levels of Noise
TLDR
By simulation experiments, it is shown that the proposed pitch detection method has more accurate and more low algorithm complexity than the traditional methods at both high and low SNR.
Active speech level estimation in noisy signals with quadrature noise suppression
TLDR
A noise-robust algorithm for estimating the active level of speech, which is the average speech power during intervals of speech activity, is presented, which gives consistently lower errors than previous methods and than the ITU-T P.56 algorithm.
Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering
TLDR
A new algorithm is presented that integrates the three subtasks of voiced speech, the estimation of the fundamental frequency and the tracking of pitch values over time into a single procedure and compares favorably with current, state-of-the-art pitch detection algorithms.
Fundamental Frequency Informed Speech Enhancement in a Flexible Statistical Framework
TLDR
A statistical estimator is derived that explicitly takes into account information about the characteristic structure of voiced speech by means of a harmonic signal model and outperforms several reference algorithms in terms of speech quality and intelligibility as predicted by instrumental measures.
Combining Zero Replacement Speech Enhancement with Lag Window Method for Pitch Detection
TLDR
An anti- noise pitch detection method that combines a speech enhancement algorithm with a spectral flattening algorithm is proposed that has the lowest gross pitch error (GPE) rate among all the methods when dealing with white-noise added male speeches.
Estimation of speaker individual spectral envelope for pitch tracking improvement
TLDR
This paper proposes to overcome some limitations of the PEFAC algorithm by employing an alternative enhancement procedure, which uses an estimation of the individual spectral envelope instead of using a universal function.
Robust Estimation of Fundamental Frequency Using Single Frequency Filtering Approach
A new method for robust estimation of fundamental frequency (F0) from speech signal is proposed in this paper. The method exploits the high SNR regions of speech in time and frequency domains in the
A Pitch-Synchronous Simultaneous Detection-Estimation Framework for Speech Enhancement
TLDR
A pitch-synchronous stochastic-deterministic estimator outperforms several benchmark methods in terms of speech intelligibility and perceived quality predicted by instrumental measures for various noise types and different signal-to-noise ratios.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 55 REFERENCES
A Pitch Estimation Filter robust to high levels of noise (PEFAC)
TLDR
PEFAC is presented, a fundamental frequency estimation algorithm that is able to identify the pitch of voiced frames reliably even at negative signal to noise ratios, and performs exceptionally well in both high and low levels of additive noise.
Maximum a-posteriori probability pitch tracking in noisy environments using harmonic model
TLDR
This paper presents an optimal estimation procedure for sound signals (such as speech) that are modeled by harmonic sources, and achieves more robust and accurate estimation of voiced speech parameters.
YIN, a fundamental frequency estimator for speech and music.
An algorithm is presented for the estimation of the fundamental frequency (F0) of speech or musical sounds. It is based on the well-known autocorrelation method with a number of modifications that
Evaluation of pitch estimation in noisy speech for application in non-intrusive speech quality assessment
TLDR
This paper evaluates the performance of four established state-of-the-art algorithms for pitch estimation in additive noise and reverberation and shows how accurate estimation of the pitch of a speech signal can influence objective speech quality measurement algorithms.
HMM-Based Multipitch Tracking for Noisy and Reverberant Speech
  • Z. Jin, Deliang Wang
  • Computer Science, Engineering
    IEEE Transactions on Audio, Speech, and Language Processing
  • 2011
TLDR
This paper proposes a robust algorithm for multipitch tracking in the presence of both background noise and room reverberation, which can reliably detect single and double pitch contours in noisy and reverberant conditions.
Super resolution pitch determination of speech signals
TLDR
Based on a new similarity model for the voice excitation process, a novel pitch determination procedure is derived that has infinite (super) resolution, better accuracy than the difference limen for F/sub 0/, robustness to noise, reliability, and modest computational complexity.
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model
  • A. Klapuri
  • Computer Science
    IEEE Transactions on Audio, Speech, and Language Processing
  • 2008
TLDR
The proposed method outperformed two reference methods in the evaluations and showed a high level of robustness in processing signals where important parts of the audible spectrum were deleted to simulate bandlimited interference.
Joint High-Resolution Fundamental Frequency and Order Estimation
TLDR
The presented method for joint estimation of the fundamental frequency and order of a set of harmonically related sinusoids based on the multiple signal classification (MUSIC) estimation criterion is shown to have an efficient implementation using fast Fourier transforms.
...
1
2
3
4
5
...