Corpus ID: 14770832

SPEAKER RECOGNITION USING GMM

@inproceedings{Kumar2010SPEAKERRU,
  title={SPEAKER RECOGNITION USING GMM},
  author={Gaurav Kumar and K. A. Prasad Raju and P. Satheesh},
  year={2010}
}
1) Sr. Assistant Professor , CSE Department, MVGR College of Engineering , Vizainagaram. emailgsk@gmail.com 2) Student, M.Tech (SE), Avanthi Institute of Engineering & Technology, Makavarapalem, Visakhapatnam. Kaprasad_raju@yahoo.com 3) Professor, Avanthi Institute of Engineering & Technology, Makavarapalem, Visakhapatnam, 4) Associate Professor, CSE Department ,MVGR College of Engineering, Vizainagaram. patchikolla@yahoo.com 

Figures from this paper

Investigation of distance effect on Gaussian Mixture Models in Speaker Identification
TLDR
The results show that MFCC has the best performance in feature extraction, and GMM has better recognition than MLP as total recognition in GMM is 93.15% and recognition in MLP is 88.06%, and the recognition rate decreases as the distance is increased from 1m to 3m. Expand
ARABIC SPEAKER RECOGNITION SYSTEM USING GAUSSIAN MIXTURE MODEL AND EM ALGORITHM
TLDR
An Arabic speaker recognition system has been developed for introducing conversion of the uttered Arabic speaker instantly after the utterance and it reported in providing 95.5% of recognition rate. Expand
TESPAR feature based isolated word speaker recognition system
  • M. Sher, N. Ahmad
  • Computer Science
  • 18th International Conference on Automation and Computing (ICAC)
  • 2012
TLDR
The results show that the speaker recognition systems with TESPAR features gives better performance with a high recognition rate and low computational complexity as compared with MFCC and LPC based features. Expand
A Survey of Speaker Recognition: Fundamental Theories, Recognition Methods and Opportunities
TLDR
This literature survey gives a concise introduction to ASR and provides an overview of the general architectures dealing with speaker recognition technologies, and upholds the past, present, and future research trends in this area. Expand
Speaker Identification Using a Novel Combination of Sparse Representation and Gaussian Mixture Models
TLDR
A novel combination of sparse representation and traditional Gaussian mixture models is presented which results in different accuracy for different sparsity and dictionary size. Expand
Hybrid Feature Selection Method Based on Harmony Search and Naked Mole-Rat Algorithms for Spoken Language Identification From Audio Signals
TLDR
A new hybrid Feature Selection (FS) algorithm have been developed using the versatile Harmony Search (HS) algorithm and a new nature-inspired algorithm called Naked Mole-Rat (NMR) algorithm to select the best subset of features and reduce the model complexity to help it train faster. Expand
Replay attack detection with auditory filter-based relative phase features
TLDR
This paper improves the discriminating ability of the RP feature by proposing two new auditory filter-based RP features for replay attack detection and a combination of the scores of these proposed RP features and a standard magnitude-based feature, that is, the constant Q transform cepstral coefficient (CQCC), is also applied to further improve the reliable detection decision. Expand
Replay attack detection using variable-frequency resolution phase and magnitude features
TLDR
A novel feature extraction method that leverages both the phase-based and magnitude-based features and fully utilizes the subband information and the complementary information from the phase and magnitude spectra is proposed. Expand
SOME NEW RESULTS ON AUTOMATIC IDENTIFICATION OF VIETNAMESE FOLK SONGS CHEO AND QUANHO
Vietnamese folk songs are very rich in genre and content. Identifying Vietnamese folk tunes will contribute to the storage and search for information about these tunes automatically. The paper willExpand
Dialogue Transcription using Gaussian Mixture Model in Speaker Diarization
Finding speaker turns and identifying the speakers is known as speaker diarization. In this study, the researchers integrate speaker diarization process with a speech-to-text task where trainingExpand
...
1
2
...

References

SHOWING 1-6 OF 6 REFERENCES
A mixture of Gaussians front end for speech recognition
TLDR
This paper describes a feature extraction technique based on fitting a Gaussian mixture model (GMM) to the speech spectral envelope and results using the Resource Management corpus, a medium vocabulary task are presented. Expand
Robust text-independent speaker identification using Gaussian mixture speaker models
TLDR
The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are effective for modeling speaker identity and is shown to outperform the other speaker modeling techniques on an identical 16 speaker telephone speech task. Expand
Fundamentals of speech recognition [4]Chap
  • Fundamentals of speech recognition [4]Chap
  • 2003
On the application of the Bayesian approach to real forensic conditions with GMM - based systems , ” in 2001 : A Speaker Odyssey — The Speaker Recognition
  • H , Fundamentals of speech recognition
  • 2003
Speaker Recognition [4]: A Tutorial
  • Proceedings of The IEEE
  • 1997