Ashish Panda

Learn More
We investigate the problem of speaker verification in noisy conditions in this paper. Our work is motivated by the fact that environmental noise severely degrades the performance of speaker verification systems. We present a model compensation scheme based on the psychoacoustic principles that adapts the model parameters in order to reduce the training and(More)
We use similarities with people we know already as a means to enhance the speaker verification accuracy. Motivated by this, we use cosine distance similarities with a set of reference speakers, cosine distance features (CDF), to improve the performance of speaker verification systems for clean and additive noise test conditions. We used mel frequency(More)
  • Ashish Panda
  • The 9th International Symposium on Chinese Spoken…
  • 2014
This paper addresses the problem of speaker verification in the presence of additive noise for resource deficient languages. Psychoacoustic model compensation (Psy-Comp) has been shown to impart noise robustness to Gaussian Mixture Model (GMM) based speaker verification systems using Mel Frequency Cepstral Coefficients (MFCCs). This work extends the idea of(More)
This paper addresses the problem of speaker verification in the presence of additive noise. We propose a fast implementation of Psychoacoustic Model Compensation (Psy-Comp) scheme for static features along with model domain mean and variance normalization for robust speaker recognition in noisy conditions. The proposed algorithms are validated through(More)
Speaker verification (SV) systems need to be robust to mimicked voices of target speakers as non-target trials to make them usable in critical applications. However, the performance of SV systems for mimicked voice test conditions has not been extensively explored. In an earlier work, we used Amrita SRE database to evaluate the performance of different(More)
This paper addresses the problem of speech recognition in the presence of additive noise. It focuses on Psychoacoustic Model Compensation (Psy-Comp) scheme, which has been shown to be a powerful technique for noise robustness. It has further implemented model domain mean and variance normalization along with Psy-Comp to alleviate channel noise for robust(More)
In this paper, we address the problem of speech recognition in the presence of additive noise. We investigate the applicability and efficacy of auditory masking in devising a robust front end for noisy features. This is achieved by introducing a masking factor into the Vector Taylor Series (VTS) equations. The resultant first order VTS approximation is used(More)
Conversational speech not only contains several variants of neutral speech but is also prominently interlaced with several speaker generated non-speech sounds such as laughter and breath. A robust speaker recognition system should be capable of recognizing a speaker irrespective of these variations in his speech. An understanding of whether the(More)