Joint Training of Expanded End-to-End DNN for Text-Dependent Speaker Verification


We propose an expanded end-to-end DNN architecture for speaker verification based on b-vectors as well as d-vectors. We embedded the components of a speaker verification system such as modeling frame-level features, extracting utterance-level features, dimensionality reduction of utterancelevel features, and trial-level scoring in an expanded end-toend DNN… (More)


4 Figures and Tables