Watch-n-patch: Unsupervised understanding of actions and relations


We focus on modeling human activities comprising multiple actions in a completely unsupervised setting. Our model learns the high-level action co-occurrence and temporal relations between the actions in the activity video. We consider the video as a sequence of short-term action clips, called action-words, and an activity is about a set of action-topics… (More)
DOI: 10.1109/CVPR.2015.7299065

10 Figures and Tables



Citations per Year

61 Citations

Semantic Scholar estimates that this publication has 61 citations based on the available data.

See our FAQ for additional information.

  • Presentations referencing similar topics