A non-speech audio CAPTCHA based on acoustic event detection and classification

  title={A non-speech audio CAPTCHA based on acoustic event detection and classification},
  author={Hendrik Meutzner and Dorothea Kolossa},
  journal={2016 24th European Signal Processing Conference (EUSIPCO)},
  • H. Meutzner, D. Kolossa
  • Published 1 August 2016
  • Computer Science
  • 2016 24th European Signal Processing Conference (EUSIPCO)
The completely automated public Turing test to tell computers and humans apart (CAPTCHA) represents an established method to prevent automated abuse of web services. Most websites provide an audio CAPTCHA - in addition to a conventional visual scheme - to facilitate access for a wider range of users. These audio CAPTCHAs are generally based on distorted speech, rendering the task difficult for untrained or non-native listeners, while still being vulnerable against attacks that make use of… 

Figures and Tables from this paper

Audio CAPTCHA Techniques: A Review
This paper is an attempt to understand existing work and its accessibility in the arena of audioCAPTCHA and explores the obstacles in use of audio CAPTCHA.
A Novel Design of Audio CAPTCHA for Visually Impaired Users
  • Mrim Alnfiai
  • Computer Science
    Int. J. Commun. Networks Inf. Secur.
  • 2020
The preliminary user study results suggest the new form of CAPTCHA called HearAct has a lot of potential for both blind and visual users and using gestures to solve theCAPTCHA challenge is the most preferable feature in the HearAct solution.
A Survey of Research on CAPTCHA Designing and Breaking Techniques
A comprehensive survey of recent developments for each CAPTCHA type in terms of usability, robustness and their weaknesses and strengths is presented and the attack methods for each category are summarized.


The Failure of Noise-Based Non-continuous Audio Captchas
Decaptcha's performance on actual observed and synthetic CAPT CHAs indicates that such speech CAPTCHAs are inherently weak and, because of the importance of audio for various classes of users, alternative audio CAPTChAs must be developed.
Using automatic speech recognition for attacking acoustic CAPTCHAs: the trade-off between usability and security
This work presents and analyzes an alternative CAPTCHA design that exploits specific capabilities of the human auditory system, i.e., auditory streaming and tolerance to reverberation and shows a far better trade-off between usability and security than the current quasi-standard approach of reCAPTCHA.
Breaking Audio CAPTCHAs
This work analyzed the security of current audio CAPTCHAs from popular Web sites by using AdaBoost, SVM, and k-NN, and achieved correct solutions for test samples with accuracy up to 71%.
Constructing Secure Audio CAPTCHAs by Exploiting Differences between Humans and Machines
This work proposes an audio CAPTCHAs that is far more robust against automated attacks than it is reported for current CAPTCHA schemes and assesses the intelligibility by means of a large-scale listening experiment.
The comparison between the deletion-based methods and the mixing-based methods for audio CAPTCHA systems
A deletion-based method (DBM) is proposed which uses the phonemic restoration effects of CAPTCHA to control the difficulty of tasks simply by the masking ratio.
Detection and classification of acoustic scenes and events: An IEEE AASP challenge
An overview of systems submitted to the public evaluation challenge on acoustic scene classification and detection of sound events within a scene as well as a detailed evaluation of the results achieved by those systems are provided.
Cepstral modulation ratio regression (CMRARE) parameters for audio signal analysis and classification
  • Rainer Martin, A. Nagathil
  • Computer Science
    2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • 2009
A set of eight parameters are used in a speech/music/noise classification task in which they achieve a classification accuracy which compares very well with other approaches including static and dynamic MFCCs.
Evaluating existing audio CAPTCHAs and an interface optimized for non-visual use
A new interface for solving CAPTCHAs optimized for non-visual use that can be added in-place to existing audio CAPT CHAs is developed and evaluated, illustrating a broadly applicable principle of accessible design: the most usable audio interfaces are often not direct translations of existing visual interfaces.
The SoundsRight CAPTCHA: an improved approach to audio human interaction proofs for blind users
Evaluation results from three rounds of usability testing document that the task success rate was higher than 90% for blind users, and Discussion, limitations, and suggestions for future research are presented.
A reverse turing test using speech
This paper describes a Reverse Turing Test using speech and presents a test that depends on the fact that human recognition of distorted speech is far more robust than automatic speech recognition techniques.