Combining cross-stream and time dimensions in phonetic speaker recognition

Abstract

Recent studies show that phonetic sequences from multiple languages can provide effective features for speaker recognition. So far, only pronunciation dynamics in the time dimension, i.e., n-gram modeling on each of the phone sequences, have been examined. In the JHU 2002 Summer Workshop, we explored modeling the statistical pronunciation dynamics across… (More)
DOI: 10.1109/ICASSP.2003.1202764

Topics

6 Figures and Tables

Statistics

051015'04'06'08'10'12'14'16'18
Citations per Year

54 Citations

Semantic Scholar estimates that this publication has 54 citations based on the available data.

See our FAQ for additional information.

Slides referencing similar topics