• Corpus ID: 17167258

The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load

  title={The INTERSPEECH 2014 computational paralinguistics challenge: cognitive \& physical load},
  author={Bj{\"o}rn Schuller and Stefan Steidl and Anton Batliner and Julien Epps and Florian Eyben and Fabien Ringeval and Erik Marchi and Yue Zhang},
The INTERSPEECH 2014 Computational Paralinguistics Challenge provides for the first time a unified test-bed for the automatic recognition of speakers’ cognitive and physical load in speech. In this paper, we describe these two Sub-Challenges, their conditions, baseline results and experimental procedures, as well as the COMPARE baseline features generated with the openSMILE toolkit and provided to the participants in the Challenge. 

Figures and Tables from this paper

The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition

Three sub-challenges are described: the estimation of the degree of nativeness, the neurological state of patients with Parkinson’s condition, and the eating conditions of speakers, i.

The INTERSPEECH 2016 Computational Paralinguistics Challenge: Deception, Sincerity & Native Language

The INTERSPEECH 2016 Computational Paralinguistics Challenge addresses three different problems for the first time in research competition under well-defined conditions: classification of deceptive

The INTERSPEECH 2017 Computational Paralinguistics Challenge: Addressee, Cold & Snoring

These sub-challenges, their conditions, and the baseline feature extraction and classifiers are described, which include data-learnt feature representations by end-to-end learning with convolutional and recurrent neural networks, and bag-of-audiowords for the first time in the challenge series.

Prediction of cognitive load from speech with the VOQAL voice quality toolbox for the interspeech 2014 computational paralinguistics challenge

The UCL system evaluates whether additional voice features computed by the VOQAL voice analysis toolbox improves performance over the baseline feature set, finding no benefit for the test set.

CoLoSS: Cognitive Load Corpus with Speech and Performance Data from a Symbol-Digit Dual-Task

A new corpus named CoLoSS (Cognitive Load by Speech and performance data in a Symbol-digit dual-task) is presented, which contains speech under cognitive load recorded in a learning task scenario and its effects on prosodic as well as voice quality features are investigated in conjunction with the corpus.


The most efficient computer-based system for detection and classification of the corresponding acoustical paralinguistic events is developed, and the architecture of this system, its main modules and methods are described.

Fisher vectors with cascaded normalization for paralinguistic analysis

This paper addresses the variability compensation issue by proposing a novel method composed of i) Fisher vector encoding of low level descriptors extracted from the signal, ii) speaker z-normalization applied after speaker clustering, and iii) non-linear normalization of features.

Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load

A novel self-supervised audio representation is designed and evaluated that leverages the effectiveness of handcrafted features (DSP-based) and the complexity of data-driven DNN representations and outperformed both extensive handcrafted feature sets and novel DNN-based audio representation learning approaches.

Classification of cognitive load from speech using an i-vector framework

The goal in this work is to automatically classify speakers’ level of cognitive load from a standard battery of reading tasks requiring varying levels of working memory using an i-vector framework that affords a systematic way to factorize the multiple sources of variability.

Convolutional Neural Networks with Data Augmentation for Classifying Speakers' Native Language

We use a feedforward Convolutional Neural Network to classify speakers’ native language for the INTERSPEECH 2016 Computational Paralinguistic Challenge Native Language SubChallenge, using no



The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism

The INTERSPEECH 2013 Computational Paralinguistics Challenge provides for the first time a unified test-bed for Social Signals such as laughter in speech. It further introduces conflict in group

Computational Paralinguistics: Emotion, Affect and Personality in Speech and Language Processing

This book presents the methods, tools and techniques that are currently being used to recognise (automatically) the affect, emotion, personality and everything else beyond linguistics

The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production

A spoken language resource for the analysis of impact that physical exercising has on human speech production and the feasibility of automatic estimation of heart rate from the human voice, in particular from sustained vowels is introduced.

Investigation of spectral centroid features for cognitive load classification

Multimodal behavior and interaction as indicators of cognitive load

A multimodal fusion model to determine the user's cognitive load in real time is presented, which overcomes problems of intrusiveness and increases applicability in real-world environments, while adapting information selection and presentation in a dynamic computer interface with reference to load.

Instructional control of cognitive load in the training of complex cognitive tasks

Limited processing capacity constrains learning and performance in complex cognitive tasks. In traditional instruction, novices' failure to adequately learn cognitive tasks can often be attributed to

Getting started with SUSAS: a speech under simulated and actual stress database

The motivation for this paper is to famil-iarize the speech community with SUSAS, which was released April 1997 on CD-ROM through the NATO and is intended to be employed in the study of how speech production and recognition varies when speaking during stressed conditions.

Working memory span tasks: A methodological review and user’s guide

The genesis of these tasks is reviewed and how and why they came to be so influential, the reliability and validity of the tasks are addressed, and more technical aspects are considered, such as optimal administration and scoring procedures.

Individual differences in working memory and reading