AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies
@article{Thao2020AttendAffectNetSB, title={AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies}, author={Ha Thi Phuong Thao and Balamurali B.T. and Dorien Herremans and Gemma Roig}, journal={ArXiv}, year={2020}, volume={abs/2010.11188} }
In this work, we propose different variants of the self-attention based network for emotion prediction from movies, which we call AttendAffectNet. We take both audio and video into account and incorporate the relation among multiple modalities by applying self-attention mechanism in a novel manner into the extracted features for emotion prediction. We compare it to the typically temporal integration of the self-attention based model, which in our case, allows to capture the relation of temporal… CONTINUE READING
Figures and Tables from this paper
References
SHOWING 1-10 OF 67 REFERENCES
Multimodal Continuous Prediction of Emotions in Movies using Long Short-Term Memory Networks
- Computer Science
- ICMR
- 2018
- 7
Multimodal Deep Models for Predicting Affective Responses Evoked by Movies
- Computer Science
- 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
- 2019
- 2
- PDF
EmoNets: Multimodal deep learning approaches for emotion recognition in video
- Computer Science
- Journal on Multimodal User Interfaces
- 2015
- 245
- PDF
Multi-modal learning for affective content analysis in movies
- Computer Science
- Multimedia Tools and Applications
- 2018
- 7
Deep learning vs. kernel methods: Performance for emotion prediction in videos
- Computer Science
- ACII
- 2015
- 50
- PDF
A multimodal mixture-of-experts model for dynamic emotion prediction in movies
- Computer Science
- 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2016
- 17
- PDF
LIRIS-ACCEDE: A Video Database for Affective Content Analysis
- Computer Science
- IEEE Transactions on Affective Computing
- 2015
- 147
- PDF
Regression-based Music Emotion Prediction using Triplet Neural Networks
- Computer Science, Engineering
- 2020 International Joint Conference on Neural Networks (IJCNN)
- 2020
- 1
- PDF
COGNIMUSE: a multimodal video database annotated with saliency, events, semantics and emotion with application to summarization
- Computer Science
- EURASIP J. Image Video Process.
- 2017
- 18