Seeing What You're Told: Sentence-Guided Activity Recognition in Video

  title={Seeing What You're Told: Sentence-Guided Activity Recognition in Video},
  author={Siddharth Narayanaswamy and Andrei Barbu and Jeffrey Mark Siskind},
  journal={2014 IEEE Conference on Computer Vision and Pattern Recognition},
We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, providing a medium for top-down and bottom-up integration as well as multi-modal integration between vision and language. We show how the roles played by participants (nouns), their characteristics (adjectives), the actions performed (verbs), the manner of such actions (adverbs… CONTINUE READING
Highly Cited
This paper has 24 citations. REVIEW CITATIONS
Recent Discussions
This paper has been referenced on Twitter 3 times over the past 90 days. VIEW TWEETS


Publications citing this paper.
Showing 1-10 of 19 extracted citations

Similar Papers

Loading similar papers…