Text-independent speaker identification using vocal tract length normalization for building universal background model

@inproceedings{Sarkar2009TextindependentSI,
  title={Text-independent speaker identification using vocal tract length normalization for building universal background model},
  author={Achintya Kumar Sarkar and Srinivasan Umesh and Shakti Prasad Rath},
  booktitle={INTERSPEECH},
  year={2009}
}
In this paper, we propose to use Vocal Tract Length Normalization (VTLN) to build the Universal Background Model (UBM) for a closed set speaker identification system. Vocal Tract Length (VTL) differences among speakers is a major source of variability in the speech signal. Since the UBM model is trained using data from many speakers, it statistically captures this inherent variation in the speech signal, which results in a “coarse” model in the acoustic space. This may cause the adapted speaker… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-4 OF 4 CITATIONS

Data-driven tree structure based UBM reconstruction for speaker verification

  • The 9th International Symposium on Chinese Spoken Language Processing
  • 2014
VIEW 1 EXCERPT
CITES METHODS

VOCAL TRACT NORMALISATION IN COMPUTER GAMES

Mariusz Ziolko, Mariusz Mąsior, Bartosz Ziolko, Magdalena Igras
  • 2013
VIEW 1 EXCERPT
CITES BACKGROUND

A Study on Universal Background Model Training in Speaker Verification

  • IEEE Transactions on Audio, Speech, and Language Processing
  • 2011
VIEW 1 EXCERPT
CITES BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-8 OF 8 REFERENCES

Constrained MLLR for Speaker Recognition

  • 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07
  • 2007
VIEW 2 EXCERPTS

The MIT Mobile Device Speaker Verification Corpus: Data Collection and Preliminary Experiments

  • 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop
  • 2006
VIEW 1 EXCERPT

Nist’04 Speaker Recognition Evaluation Campaign: New LIA Speaker Detection Plateform based on ALIZE Toolkit

J. F. Bonastre, N. Scheffer, C. Fredouille, D. Matrouf
  • NIST SRE’04 Workshop, Toledo, Spain, Jun. 2004. 2334
  • 2004
VIEW 1 EXCERPT

HTK Book

S. Young, D. Kershaw, J. Odell, V. Valtchev, P. Woodland
  • Copyright 2001-2006 CUED.
  • 2001
VIEW 2 EXCERPTS

A frequency warping approach to speaker normalization

  • IEEE Trans. Speech and Audio Processing
  • 1998
VIEW 1 EXCERPT