Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms

@inproceedings{Ma2018EmotionRF,
  title={Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms},
  author={Xi Ma and Zhiyong Wu and Jia Jia and Mingxing Xu and Helen M. Meng and Lianhong Cai},
  booktitle={INTERSPEECH},
  year={2018}
}
In this work, an approach of emotion recognition is proposed for variable-length speech segments by applying deep neutral network to spectrograms directly. The spectrogram carries comprehensive para-lingual information that are useful for emotion recognition. We tried to extract such information from spectrograms and accomplish the emotion recognition task by combining Convolutional Neural Networks (CNNs) with Recurrent Neural Networks (RNNs). To handle the variablelength speech segments, we… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-6 OF 6 CITATIONS

Weighted Feature Fusion Based Emotional Recognition for Variable-length Speech using DNN

  • 2019 15th International Wireless Communications & Mobile Computing Conference (IWCMC)
  • 2019
VIEW 8 EXCERPTS
CITES RESULTS, METHODS & BACKGROUND
HIGHLY INFLUENCED

A Path Signature Approach for Speech Emotion Recognition

Bo Wang, Maria Liakata, +3 authors Kate E. A. Saunders
  • INTERSPEECH 2019
  • 2019
VIEW 1 EXCERPT
CITES METHODS

Speech Emotion Recognition Using Capsule Networks

  • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2019
VIEW 2 EXCERPTS
CITES BACKGROUND

USING CAPSULE NETWORKS

VIEW 2 EXCERPTS
CITES BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-10 OF 17 REFERENCES

Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks

  • 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2015
VIEW 2 EXCERPTS
HIGHLY INFLUENTIAL