Learn More
—This letter proposes a channel-attentive Mel frequency cepstral coefficient (CAMFCC) method to improve the utilization of uncorrupted or more reliable frequency bands for robust speech recognition. This method obtains a channel attention matrix by reliability estimation of Mel filter bank channels, and both the input Mel frequency cepstral coefficients and(More)
Automatic Speech Recognition(ASR) systems are limited in the computational power and memory resources, especially in low-memory/low-power environments such as personal digital assistants. The parameter quantization is the one of the ways to achieve these conditions. In this work, we compare various subvector clustering procedures for the parameter(More)
This paper focuses on detection of a single emotion and verification of a specific emotion type in a test utterance. To utilize a probabilistic output of a classifier as well as to exploit various long term acoustic features, we built a prob-abilistic output SVM and applied several approximated log likelihood ratio tests for emotion verification.(More)
  • 1