Learn More
The Modified Discrete Cosine Transform (MDCT) is widely used in audio signals compression, but mostly limited to representing audio signals. This is because the MDCT is a real transform: Phase information is missing and spectral power varies frame to frame even for pure sine waves. We have a key observation concerning the structure of the MDCT spectrum of a(More)
Speech emotion as anger, boredom, fear, gladness, etc. is high semantic information and its automatic analysis may have many applications such as smart human-computer interactions or multimedia indexing. Main difficulties for an efficient speech emotion classification reside in complex emotional class borders leading to necessity of appropriate audio(More)
Fuzzy kappa for the agreement measure of fuzzy classifications Weibei Dou , Yuan Ren, Qian Wu, Su Ruan, Yanping Chen, Daniel Bloyet, JeanMarc Constans Department of Electronic Engineering, Tsinghua University, 100084 Beijing, China GREYC-CNRS UMR 6072, 6 Boulevard Maréchal Juin, 14050 Caen, France cCReSTIC, 9 Rue de Qubec,10026 Troyes, France Imaging(More)
This paper deals with speech emotion analysis within the context of increasing awareness of the wide application potential of affective computing. Unlike most works in the literature which mainly rely on classical frequency and energy based features along with a single global classifier for emotion recognition, we propose in this paper some new harmonic and(More)
As bearer of high level semantics, audio signal is being more and more used in content-based multimedia retrieval. In this paper, we investigate the ball hit detection for sports games and propose a novel approach to detect ball hits. By employing Energy Peak Detection (EPD) and Mel Frequency Cepstral Coefficient-based (MFCC-based) Refinement (MBR), high(More)
A framework of fuzzy information fusion is proposed in this paper to automatically segment tumor areas of human brain from multispectral magnetic resonance imaging (MRI) such as T1-weighted, T2-weighted and proton density (PD) images. A priori knowledge about tumors described by radiology experts for different types of MRI are very helpful to guide a(More)
Music mood present the inherent emotional state of music on certain duration of music segment. However, the mood may vary in the music pieces. Thus it is important to investigate the duration of music segments which can best present the stable mood states in music. Four versions of music datasets with duration of clips from 4 seconds to 32 seconds are(More)
The purpose of this paper is to make an automatic classification of speech into seven emotional classes as anger, boredom, disgust, fear, gladness, neutral and sadness. A two-stage classification composed of several sub-classifiers is proposed. A feature set with 68 features has been computed over 286 speech samples from the Berlin database. The sequential(More)
This paper deals with speech emotion analysis within the context of increasing awareness of the wide application potential of affective computing. Unlike most works in the literature which mainly rely on classical frequency and energy based features along with a single global classifier for emotion recognition, we propose in this paper some new harmonic and(More)
Feature subset selection is an important subject when training classifiers in Machine Learning (ML) problems. Too many input features in a ML problem may lead to the so-called “curse of dimensionality”, which describes the fact that the complexity of the classifier parameters adjustment during training increases exponentially with the number of features.(More)