Action Recognition using Visual Attention
@article{Sharma2015ActionRU, title={Action Recognition using Visual Attention}, author={Shikhar Sharma and Ryan Kiros and R. Salakhutdinov}, journal={ArXiv}, year={2015}, volume={abs/1511.04119} }
We propose a soft attention based model for the task of action recognition in videos. [...] Key Method The model essentially learns which parts in the frames are relevant for the task at hand and attaches higher importance to them. We evaluate the model on UCF-11 (YouTube Action), HMDB-51 and Hollywood2 datasets and analyze how the model focuses its attention depending on the scene and the action being performed.Expand
Supplemental Code
Github Repo
Via Papers with Code
Action recognition using soft attention based deep recurrent neural networks
Figures, Tables, and Topics from this paper
494 Citations
Improving human action recognitionby temporal attention
- Computer Science
- 2017 IEEE International Conference on Image Processing (ICIP)
- 2017
- 3
CHAM: Action recognition using convolutional hierarchical attention model
- Computer Science, Psychology
- 2017 IEEE International Conference on Image Processing (ICIP)
- 2017
- 6
- Highly Influenced
- PDF
Recurrent attention network using spatial-temporal relations for action recognition
- Computer Science
- Signal Process.
- 2018
- 15
DTA: Double LSTM with temporal-wise attention network for action recognition
- Computer Science
- 2017 3rd IEEE International Conference on Computer and Communications (ICCC)
- 2017
- 1
Hierarchical Attention Network for Action Recognition in Videos
- Computer Science
- ArXiv
- 2016
- 56
- Highly Influenced
- PDF
References
SHOWING 1-10 OF 41 REFERENCES
Modeling video evolution for action recognition
- Computer Science
- 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2015
- 377
- PDF
Describing Videos by Exploiting Temporal Structure
- Computer Science, Mathematics
- 2015 IEEE International Conference on Computer Vision (ICCV)
- 2015
- 762
- PDF
Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos
- Computer Science
- International Journal of Computer Vision
- 2017
- 244
- PDF
DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition
- Computer Science
- 2014 IEEE Conference on Computer Vision and Pattern Recognition
- 2014
- 107
- PDF
Beyond short snippets: Deep networks for video classification
- Computer Science
- 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2015
- 1,673
- Highly Influential
- PDF
Long-term recurrent convolutional networks for visual recognition and description
- Computer Science, Medicine
- 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2015
- 3,523
- PDF