Tripping through time: Efficient Localization of Activities in Videos
@article{Hahn2020TrippingTT, title={Tripping through time: Efficient Localization of Activities in Videos}, author={Meera Hahn and Asim Kadav and James M. Rehg and H. Graf}, journal={ArXiv}, year={2020}, volume={abs/1904.09936} }
Localizing moments in untrimmed videos via language queries is a new and interesting task that requires the ability to accurately ground language into video. [...] Key Method Furthermore, TripNet uses reinforcement learning to efficiently localize relevant activity clips in long videos, by learning how to intelligently skip around the video. It extracts visual features for fewer frames to perform activity classification. In our evaluation over Charades-STA, ActivityNet Captions and the TACoS dataset, we find…Expand Abstract
Figures, Tables, and Topics from this paper
19 Citations
Fine-grained Iterative Attention Network for Temporal Language Localization in Videos
- Computer Science
- ACM Multimedia
- 2020
- 1
- PDF
A Survey of Temporal Activity Localization via Language in Untrimmed Videos
- Computer Science
- 2020 International Conference on Culture-oriented Science & Technology (ICCST)
- 2020
Local-Global Video-Text Interactions for Temporal Grounding
- Computer Science
- 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
- 2020
- 7
- PDF
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
- Computer Science
- 2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
- 2020
- 15
- PDF
DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video
- Computer Science
- ArXiv
- 2020
- 1
- PDF
Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language
- Computer Science
- AAAI
- 2020
- 19
- PDF
Weakly-Supervised Multi-Level Attentional Reconstruction Network for Grounding Textual Queries in Videos
- Computer Science
- ArXiv
- 2020
- 2
- PDF
References
SHOWING 1-10 OF 64 REFERENCES
TALL: Temporal Activity Localization via Language Query
- Computer Science
- 2017 IEEE International Conference on Computer Vision (ICCV)
- 2017
- 152
- Highly Influential
- PDF
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
- Computer Science
- 2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
- 2020
- 15
- PDF
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs
- Computer Science
- 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2016
- 450
- PDF
Localizing Moments in Video with Natural Language
- Computer Science
- 2017 IEEE International Conference on Computer Vision (ICCV)
- 2017
- 176
- Highly Influential
- PDF
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos
- Computer Science
- AAAI
- 2019
- 25
- PDF
Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization
- Computer Science
- ECCV
- 2018
- 44
- PDF
To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression
- Computer Science
- AAAI
- 2019
- 48
- PDF
MAC: Mining Activity Concepts for Language-Based Temporal Localization
- Computer Science
- 2019 IEEE Winter Conference on Applications of Computer Vision (WACV)
- 2019
- 43
- PDF
Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model
- Computer Science
- 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
- 2019
- 35
- PDF
Temporal Context Network for Activity Localization in Videos
- Computer Science
- 2017 IEEE International Conference on Computer Vision (ICCV)
- 2017
- 147
- PDF