Duration and pronunciation conditioned lexical modeling for speaker verification


We propose a method to improve speaker recognition lexical model performance using acoustic-prosodic information. More specifically, the lexical model is trained using durationand pronunciation-conditioned word N-grams, simultaneously modeling lexical information along with their acoustic and prosodic characteristics. Support vector machines are used for… (More)


3 Figures and Tables

Slides referencing similar topics