An Analysis of Action Recognition Datasets for Language and Vision Tasks


A large amount of recent research has focused on tasks that combine language and vision, resulting in a proliferation of datasets and methods. One such task is action recognition, whose applications include image annotation, scene understanding and image retrieval. In this survey, we categorize the existing approaches based on how they conceptualize this… (More)
DOI: 10.18653/v1/P17-2011


3 Figures and Tables

Slides referencing similar topics