Two-Stream Convolutional Networks for Action Recognition in Videos

  title={Two-Stream Convolutional Networks for Action Recognition in Videos},
  author={Karen Simonyan and Andrew Zisserman},
We investigate architectures of discriminatively trained deep Convolutional Networks (ConvNets) for action recognition in video. The challenge is to capture the complementary information on appearance from still frames and motion between frames. We also aim to generalise the best performing hand-crafted features within a data-driven learning framework. Our contribution is three-fold. First, we propose a two-stream ConvNet architecture which incorporates spatial and temporal networks. Second, we… CONTINUE READING
Highly Influential
This paper has highly influenced 405 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 2,400 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 1,532 extracted citations

Early Action Prediction by Soft Regression.

IEEE transactions on pattern analysis and machine intelligence • 2018
View 12 Excerpts
Highly Influenced

Human Action Monitoring for Healthcare Based on Deep Learning

IEEE Access • 2018
View 4 Excerpts
Highly Influenced

Multiscale Deep Alternative Neural Network for Large-Scale Video Classification

IEEE Transactions on Multimedia • 2018
View 11 Excerpts
Highly Influenced

Sequential Video VLAD: Training the Aggregation Locally and Temporally

IEEE Transactions on Image Processing • 2018
View 15 Excerpts
Highly Influenced

Temporal Attentive Network for Action Recognition

2018 IEEE International Conference on Multimedia and Expo (ICME) • 2018
View 6 Excerpts
Highly Influenced

Action recognition by saliency-based dense sampling

Neurocomputing • 2017
View 8 Excerpts
Highly Influenced

2,401 Citations

Citations per Year
Semantic Scholar estimates that this publication has 2,401 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 31 references

Large-Scale Video Classification with Convolutional Neural Networks

2014 IEEE Conference on Computer Vision and Pattern Recognition • 2014
View 10 Excerpts
Highly Influenced

On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines

Journal of Machine Learning Research • 2001
View 5 Excerpts
Highly Influenced

Action Recognition with Improved Trajectories

2013 IEEE International Conference on Computer Vision • 2013
View 9 Excerpts
Highly Influenced

HMDB: A large video database for human motion recognition

2011 International Conference on Computer Vision • 2011
View 7 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…