Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition

@article{Schuller2008CombiningSR,
  title={Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition},
  author={Bj{\"o}rn W. Schuller and Bogdan Vlasenko and Dejan Arsic and Gerhard Rigoll and Andreas Wendemuth},
  journal={2008 IEEE International Conference on Multimedia and Expo},
  year={2008},
  pages={1333-1336}
}
Recognition of emotion in speech usually uses acoustic models that ignore the spoken content. Likewise one general model per emotion is trained independent of the phonetic structure. Given sufficient data, this approach seemingly works well enough. Yet, this paper tries to answer the question whether acoustic emotion recognition strongly depends on phonetic content, and if models tailored for the spoken unit can lead to higher accuracies. We therefore investigate phoneme-, and word-models by… CONTINUE READING
Highly Cited
This paper has 40 citations. REVIEW CITATIONS