Situation Recognition: Visual Semantic Role Labeling for Image Understanding

@article{Yatskar2016SituationRV,
  title={Situation Recognition: Visual Semantic Role Labeling for Image Understanding},
  author={Mark Yatskar and Luke Zettlemoyer and Ali Farhadi},
  journal={2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2016},
  pages={5534-5542}
}
  • Mark Yatskar, Luke Zettlemoyer, Ali Farhadi
  • Published 2016
  • Computer Science
  • 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
  • This paper introduces situation recognition, the problem of producing a concise summary of the situation an image depicts including: (1) the main activity (e.g., clipping), (2) the participating actors, objects, substances, and locations (e.g., man, shears, sheep, wool, and field) and most importantly (3) the roles these participants play in the activity (e.g., the man is clipping, the shears are his tool, the wool is being clipped from the sheep, and the clipping is in a field). We use… CONTINUE READING

    Citations

    Publications citing this paper.
    SHOWING 1-10 OF 114 CITATIONS

    Grounded Situation Recognition

    VIEW 10 EXCERPTS
    CITES METHODS & BACKGROUND

    Automatic generation of composite image descriptions

    • Chang Liu, Armin Shmilovici, Mark Last
    • Computer Science
    • 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)
    • 2017
    VIEW 3 EXCERPTS
    CITES METHODS & BACKGROUND

    Mixture-Kernel Graph Attention Network for Situation Recognition

    VIEW 8 EXCERPTS
    CITES BACKGROUND, RESULTS & METHODS
    HIGHLY INFLUENCED

    Situation Recognition with Graph Neural Networks

    VIEW 10 EXCERPTS
    CITES METHODS & BACKGROUND
    HIGHLY INFLUENCED

    Graph neural network for situation recognition

    VIEW 8 EXCERPTS
    CITES BACKGROUND, RESULTS & METHODS
    HIGHLY INFLUENCED

    Semantic Image Retrieval via Active Grounding of Visual Situations

    VIEW 1 EXCERPT
    CITES BACKGROUND

    Interpreting Context of Images Using Scene Graphs

    VIEW 1 EXCERPT
    CITES BACKGROUND

    Disambiguating Visual Verbs

    VIEW 3 EXCERPTS
    CITES BACKGROUND

    MovieGraphs: Towards Understanding Human-Centric Situations from Videos

    FILTER CITATIONS BY YEAR

    2016
    2020

    CITATION STATISTICS

    • 19 Highly Influenced Citations

    • Averaged 28 Citations per year from 2018 through 2020

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 58 REFERENCES

    Visual Semantic Role Labeling

    VIEW 1 EXCERPT

    Microsoft COCO: Common Objects in Context

    VIEW 1 EXCERPT

    Show and tell: A neural image caption generator

    VIEW 1 EXCERPT

    Actions in context

    VIEW 2 EXCERPTS

    Grouplet: A structured image representation for recognizing human and object interactions

    • Bangpeng Yao, Li Fei-Fei
    • Computer Science
    • 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition
    • 2010
    VIEW 1 EXCERPT

    What Are You Talking About? Text-to-Image Coreference

    VIEW 1 EXCERPT

    CIDEr: Consensus-based image description evaluation

    VIEW 1 EXCERPT