Action Recognition using Visual Attention

Abstract

We propose a soft attention based model for the task of action recognition in videos. We use multi-layered Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units which are deep both spatially and temporally. Our model learns to focus selectively on parts of the video frames and classifies videos after taking a few glimpses. The model… (More)

Topics

Figures and Tables

Sorry, we couldn't extract any figures or tables for this paper.

Statistics

050100201620172018
Citations per Year

153 Citations

Semantic Scholar estimates that this publication has 153 citations based on the available data.

See our FAQ for additional information.

Slides referencing similar topics