• Corpus ID: 246240353

Synthetic speech detection using meta-learning with prototypical loss

@article{Pal2022SyntheticSD,
  title={Synthetic speech detection using meta-learning with prototypical loss},
  author={Monisankha Pal and Aditya Raikar and Ashish Panda and Sunil Kumar Kopparapu},
  journal={ArXiv},
  year={2022},
  volume={abs/2201.09470}
}
Recent works on speech spoofing countermeasures still lack generalization ability to unseen spoofing attacks. This is one of the key issues of ASVspoof challenges especially with the rapid development of diverse and high-quality spoofing algorithms. In this work, we address the generalizability of spoofing detection by proposing prototypical loss under the meta-learning paradigm to mimic the unseen test scenario during training. Prototypical loss with metric-learning objectives can learn the… 

Tables from this paper

References

SHOWING 1-10 OF 34 REFERENCES
Replay and Synthetic Speech Detection with Res2Net Architecture
  • Xu Li, N. Li, H. Meng
  • Computer Science
    ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2021
TLDR
The Res2Net model consistently outperforms ResNet34 and ResNet50 by a large margin in both physical access (PA) and logical access (LA) of the ASVspoof 2019 corpus and the constant-Q transform (CQT) achieves the most promising performance in both PA and LA scenarios.
Generalization of Audio Deepfake Detection
TLDR
This paper focuses on overcoming the issue of generalization ability of spoofing countermeasures by using large margin cosine loss function (LMCL) and online frequency masking augmentation to force the neural network to learn more robust feature embeddings.
CRIM's System Description for the ASVSpoof2021 Challenge
TLDR
The results show that using codec augmentation, activation ensemble and HOSP technique can help the system to be more robust against trials with adversarial conditions, and further improvement could be made by performing score-level fusion among different systems.
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
  • J. Yamagishi, Xin Wang, H. Delgado
  • Computer Science
    2021 Edition of the Automatic Speaker Verification and Spoofing Countermeasures Challenge
  • 2021
TLDR
Results for the physical access task show the difficulty in detecting attacks in real, variable physical spaces, and the introduction of channel and compression variability which compound the difficulty.
One-Class Learning Towards Synthetic Voice Spoofing Detection
TLDR
This work proposes an anti-spoofing system to detect unknown synthetic voice spoofing attacks (i.e., text-to-speech or voice conversion) using one-class learning, which achieves an equal error rate (EER) and outperforming all existing single systems.
The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection
TLDR
ASVspoof 2017, the second in the series, focused on the development of replay attack countermeasures and indicates that the quest for countermeasures which are resilient in the face of variable replay attacks remains very much alive.
Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
TLDR
WavAugment is intro-duce, a time-domain data augmentation library which is adapt and optimize for the specificities of CPC (raw waveform input, contrastive loss, past versus future structure), and finds that applying augmentation only to the segments from which the CPC prediction is performed yields better results.
Data Augmentation with Signal Companding for Detection of Logical Access Attacks
TLDR
It is found that the proposed data augmentation technique based on signal companding outperforms the state-of-the-art spoofing countermeasures showing ability to handle unknown nature of attacks.
A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection
TLDR
A comparison of countermeasure models on the ASVspoof 2019 logical access scenario takes into account common strategies to deal with input trials of varied length, recently proposed marginbased training criteria, and widely used front ends.
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
TLDR
The 2019 database, protocols and challenge results are described, and major findings which demonstrate the real progress made in protecting against the threat of spoofing and fake audio are outlined.
...
...