Joint Factor Analysis Versus Eigenchannels in Speaker Recognition

Abstract

We compare two approaches to the problem of session variability in Gaussian mixture model (GMM)-based speaker verification, eigenchannels, and joint factor analysis, on the National Institute of Standards and Technology (NIST) 2005 speaker recognition evaluation data. We show how the two approaches can be implemented using essentially the same software at all stages except for the enrollment of target speakers. We demonstrate the effectiveness of zt-norm score normalization and a new decision criterion for speaker recognition which can handle large numbers of t-norm speakers and large numbers of speaker factors at little computational cost. We found that factor analysis was far more effective than eigenchannel modeling. The best result we obtained was a detection cost of 0.016 on the core condition (all trials) of the evaluation

DOI: 10.1109/TASL.2006.881693

Extracted Key Phrases

10 Figures and Tables

050'05'06'07'08'09'10'11'12'13'14'15'16'17
Citations per Year

548 Citations

Semantic Scholar estimates that this publication has 548 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Kenny2007JointFA, title={Joint Factor Analysis Versus Eigenchannels in Speaker Recognition}, author={Patrick Kenny and Gilles Boulianne and Pierre Ouellet and Pierre Dumouchel}, journal={IEEE Transactions on Audio, Speech, and Language Processing}, year={2007}, volume={15}, pages={1435-1447} }