DT-SV: A Transformer-based Time-domain Approach for Speaker Verification

  title={DT-SV: A Transformer-based Time-domain Approach for Speaker Verification},
  author={Nan Zhang and Jianzong Wang and Zhenhou Hong and Chendong Zhao and Xiaoyang Qu and Jing Xiao},
  journal={2022 International Joint Conference on Neural Networks (IJCNN)},
Speaker verification (SV) aims to determine whether the speaker's identity of a test utterance is the same as the reference speech. In the past few years, extracting speaker embeddings using deep neural networks for SV systems has gone mainstream. Recently, different attention mechanisms and Transformer networks have been explored widely in SV fields. However, utilizing the original Transformer in SV directly may have frame-level information waste on output features, which could lead to… 
1 Citations

