Noise power spectral density estimation based on optimal smoothing and minimum statistics

@article{Martin2001NoisePS,
  title={Noise power spectral density estimation based on optimal smoothing and minimum statistics},
  author={Rainer Martin},
  journal={IEEE Trans. Speech Audio Process.},
  year={2001},
  volume={9},
  pages={504-512}
}
  • Rainer Martin
  • Published 2001
  • Computer Science, Mathematics
  • IEEE Trans. Speech Audio Process.
We describe a method to estimate the power spectral density of nonstationary noise when a noisy speech signal is given. The method can be combined with any speech enhancement algorithm which requires a noise power spectral density estimate. In contrast to other methods, our approach does not use a voice activity detector. Instead it tracks spectral minima in each frequency band without any distinction between speech activity and speech pause. By minimizing a conditional mean square estimation… Expand
Kalman filtering based noise power spectral density estimation for speech enhancement
TLDR
Objective and subjective performance evaluations show that the proposed scheme for estimating the power spectral density of nonstationary noise when a noisy speech signal is given exhibits a good noise tracking performance and that it achieves improvement in the quality of the enhanced speech as compared to the case where noise PSD estimate remains invariant across time. Expand
A novel fast nonstationary noise tracking approach based on MMSE spectral power estimator
TLDR
A novel noise PSD estimation algorithm based on minimum mean-square error (MMSE) is proposed, which exhibits more excellent noise tracking capability under various nonstationary noise environments and SNR conditions. Expand
Fast Nonstationary Noise Tracking Based on Log-Spectral Power MMSE Estimator and Temporal Recursive Averaging
TLDR
A novel noise PSD tracking algorithm, which employs a log-spectral power minimum mean square error (MMSE) estimator, which exhibits more excellent noise tracking capability under various nonstationary noise environments and SNR levels. Expand
Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging
  • I. Cohen
  • Mathematics, Computer Science
  • IEEE Trans. Speech Audio Process.
  • 2003
TLDR
It is shown that in nonstationary noise environments and under low SNR conditions, the IMCRA approach is very effective, compared to a competitive method, it obtains a lower estimation error, and when integrated into a speech enhancement system achieves improved speech quality and lower residual noise. Expand
Non-stationary noise power spectral density estimation based on regional statistics
TLDR
The proposed regional statistics consist of four features representing the statistics of the past and present periodograms in a short-time period that are efficient in characterizing the statistical difference between noise PSD and noisy speech PSD. Expand
Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay
TLDR
It is shown that the proposed speech presence probability (SPP) approach maintains the quick noise tracking performance of the bias compensated minimum mean-square error (MMSE)-based approach while exhibiting less overestimation of the spectral noise power and an even lower computational complexity. Expand
Online Noise Estimation Using Stochastic-Gain HMM for Speech Enhancement
TLDR
A noise estimation algorithm for single-channel noise suppression in dynamic noisy environments and an estimator of the noise power spectral density (PSD) based on recursive averaging of estimated noise sample spectra are developed. Expand
Noise Power Spectral Density Estimation based on Maximum a Posteriori and Generalized Gamma Distribution
Noise power spectral density (PSD) estimation is a crucial part of speech enhancement system due to its contributory effect on the quality of the noise reduced speech. A novel estimation method forExpand
Speech enhancement based on minimum mean-square error estimation and supergaussian priors
  • Rainer Martin
  • Mathematics, Computer Science
  • IEEE Transactions on Speech and Audio Processing
  • 2005
TLDR
Compared to algorithms based on the Gaussian assumption, such as the Wiener filter or the Ephraim and Malah (1984) MMSE short-time spectral amplitude estimator, the estimators based on these supergaussian densities deliver an improved signal-to-noise ratio. Expand
Noise power spectrum estimation using constrained variance spectral smoothing and minima tracking
TLDR
The heart of the proposed algorithm is a constrained variance smoothing (CVS) filter, which smoothes the noisy STPS independently of the noise level, which exhibits superior performance over the other recently proposed noise estimation algorithms. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 23 REFERENCES
Spectral Subtraction Based on Minimum Statistics
TLDR
An unbiased noise power estimator based on minimum statistics is derived and its statistical properties and its performance in the context of spectral subtraction are discussed. Expand
Speech Enhancement Using a-Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator
TLDR
This paper derives a minimum mean-square error STSA estimator, based on modeling speech and noise spectral components as statistically independent Gaussian random variables, which results in a significant reduction of the noise, and provides enhanced speech with colorless residual noise. Expand
Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments
  • D. Malah, R. Cox, A. Accardi
  • Computer Science
  • 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258)
  • 1999
TLDR
The estimated q's are used to control both the gain and the update of the estimated noise spectrum during speech presence in a modified MMSE log-spectral amplitude estimator, which resulted in higher scores than for the IS-127 standard enhancement algorithm, when pre-processing noisy speech for a coding application. Expand
Enhancement of speech corrupted by acoustic noise
TLDR
This paper describes a method for enhancing speech corrupted by broadband noise based on the spectral noise subtraction method, which can automatically adapt to a wide range of signal-to-noise ratios, as long as a reasonable estimate of the noise spectrum can be obtained. Expand
Computationally efficient speech enhancement by spectral minima tracking in subbands
TLDR
An eecient algorithm for the enhancement of speech signals which are heavily corrupted by short-time stationary, acoustically or electrically added disturbances is presented, based on spectral amplitude estimation using an overlap-add FFT lter bank system. Expand
Speech enhancement using a soft-decision noise suppression filter
One way of enhancing speech in an additive acoustic noise environment is to perform a spectral decomposition of a frame of noisy speech and to attenuate a particular spectral line depending on howExpand
A voice activity detector employing soft decision based noise spectrum adaptation
  • Jongseo Sohn, Wonyong Sung
  • Computer Science
  • Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)
  • 1998
TLDR
A novel noise spectrum adaptation algorithm using the soft decision information of the proposed decision rule is developed, which is robust, especially for the time-varying noise such as babble noise. Expand
Noise estimation techniques for robust speech recognition
TLDR
Two new techniques are presented to estimate the noise spectra or the noise characteristics for noisy speech signals and can be combined with a nonlinear spectral subtraction scheme to enhance noisy speech and to improve the performance of speech recognition systems. Expand
Kammeyer \Comparison of one-and two-channel noise-estimation techniques
TLDR
Several one-channel and two-channel noise+stimation techniques are compared on their estimation features in a non-stationary noisy environment while a speech signal is present, as this is one of the unsolved problems for spectral subtraction algorithms. Expand
A 2.4 kbit/s MELP coder candidate for the new U.S. Federal Standard
TLDR
The enhanced MELP speech coder is described, which is a candidate for the new U.S. Federal Standard at 2.4 kbits/s and has been optimized for performance in acoustic background noise and in channel errors, as well as for efficient real-time implementation. Expand
...
1
2
3
...