Replay attack detection with auditory filter-based relative phase features

  title={Replay attack detection with auditory filter-based relative phase features},
  author={Zeyan Oo and Longbiao Wang and Khomdet Phapatanaburi and Meng Liu and Seiichi Nakagawa and Masahiro Iwahashi and Jianwu Dang},
  journal={EURASIP Journal on Audio, Speech, and Music Processing},
There are many studies on detecting human speech from artificially generated speech and automatic speaker verification (ASV) that aim to detect and identify whether the given speech belongs to a given speaker. Recent studies demonstrate the success of the relative phase (RP) feature in speaker recognition/verification and the detection of synthesized speech and converted speech. However, there are few studies that focus on the RP feature for replay attack detection. In this paper, we improve… Expand
Replay Attack Detection Using Linear Prediction Analysis-Based Relative Phase Features
This paper proposes two novel LPA- based relative phase features, namely, linear prediction residual-based relative phase (LPR-RP) and linear prediction analysis estimated speech-basedrelative phase ( LPAES-RP). Expand
Linear Prediction Residual-Based Constant-Q Cepstral Coefficients for Replay Attack Detection
This paper proposes a new linear prediction residual-based constant-Q cepstral coefficients (LPR-CQCC) feature for replay attack detection (RAD) that outperformed baseline CQCC on the development set and was very close to CQ CC on the evaluation set. Expand
A multi-branch ResNet with discriminative features for detection of replay speech signals
Nowadays, the security of ASV systems is increasingly gaining attention. As one of the common spoofing methods, replay attacks are easy to implement but difficult to detect. Many researchers focus onExpand
Replay attack detection using variable-frequency resolution phase and magnitude features
A novel feature extraction method that leverages both the phase-based and magnitude-based features and fully utilizes the subband information and the complementary information from the phase and magnitude spectra is proposed. Expand
Automatic speaker verification systems and spoof detection techniques: review and analysis
The paper makes best efforts to give the complete idea of ASV to new comers to this area and puts some light on some of the spoofing attacks that can be targeted during implementation of the future ASV systems. Expand
EarArray: Defending against DolphinAttack via Acoustic Attenuation
EarArray is a lightweight method that can not only detect inaudible voice commands but also identify the direction of attackers without requiring any extra hardware or hardware modification. Expand


Detecting Converted Speech and Natural Speech for anti-Spoofing Attack in Speaker Recognition
Experiments show that the performance of the features derived from phase spectrum outperform the melfrequency cepstral coefficients (MFCCs) tremendously: even without converted speech for training, the equal error rate (EER) is reduced from 20.20% of MFCCs to 2.35%. Expand
Spoofing Speech Detection Using Modified Relative Phase Information
Modified relative phase (MRP) information extracted from a Fourier spectrum is proposed for spoofing speech detection and it significantly outperforms the MFCC, modified group delay, and other phase information based features. Expand
Relative phase information for detecting human speech and spoofed speech
In this study, relative phase information extracted from a Fourier spectrum is used to detect human and spoofed speech and significantly outperforms the MFCC and modified group delay. Expand
Novel Variable Length Teager Energy Separation Based Instantaneous Frequency Features for Replay Detection
A novel replay detector based on Variable length Teager Energy OperatorEnergy Separation Algorithm-Instantaneous Frequency Cosine Coefficients (VESA-IFCC) for the ASV spoof 2017 challenge is proposed and the performance of the proposed feature set is compared with the features developed for detecting synthetic and voice converted speech. Expand
Multiple Phase Information Combination for Replay Attacks Detection
This paper applies the mel-scale relative phase feature and source-filter vocal tract feature in phase domain for replay attacks detection and achieves 55.6% relative error reduction rate than the conventional magnitude-based feature. Expand
DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification
Simultaneous enhancement of amplitude and phase based feature was effective, and it achieved about 24% relative error reduction comparing with individual feature enhancement. Expand
Spoofing detection goes noisy: An analysis of synthetic speech detection in the presence of additive noise
A significant gap is revealed between the performance of state-of-the-art spoofing detectors between clean and noisy conditions and a study with two score fusion strategies shows that combining different feature based systems improves recognition accuracy for known and unknown attacks in both clean and noise conditions. Expand
Auditory Filterbank Learning for Temporal Modulation Features in Replay Spoof Speech Detection
A standalone replay spoof speech detection system to classify the natural vs. replay speech signals using Convolutional Restricted Boltzmann Machine (ConvRBM) with the pre-emphasized speech signals. Expand
Spoof Detection Using Source, Instantaneous Frequency and Cepstral Features
The main focus of this work is on exploiting the differences in the speech-specific nature of genuine speech signals and spoofed speech signals generated by replay attacks. Expand
Exploration of Compressed ILPR Features for Replay Attack Detection
This work explores the use of discrete cosine transform compressed integrated linear prediction residual (ILPR) features for discriminating between genuine and replayed signals in the problem of detecting replay attacks on speaker verification systems. Expand