Commonly Uncommon: Semantic Sparsity in Situation Recognition

@article{Yatskar2016CommonlyUS,
  title={Commonly Uncommon: Semantic Sparsity in Situation Recognition},
  author={Mark Yatskar and Vicente Ordonez and Luke Zettlemoyer and Ali Farhadi},
  journal={2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2016},
  pages={6335-6344}
}
Semantic sparsity is a common challenge in structured visual classification problems, when the output space is complex, the vast majority of the possible predictions are rarely, if ever, seen in the training set. This paper studies semantic sparsity in situation recognition, the task of producing structured summaries of what is happening in images, including activities, objects and the roles objects play within the activity. For this problem, we find empirically that most substructures required… CONTINUE READING
5
Twitter Mentions

Citations

Publications citing this paper.
SHOWING 1-10 OF 16 CITATIONS

Automatic generation of composite image descriptions

VIEW 8 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Recurrent Models for Situation Recognition

VIEW 9 EXCERPTS
CITES METHODS, BACKGROUND & RESULTS
HIGHLY INFLUENCED

Situation Recognition with Graph Neural Networks

VIEW 6 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Graph neural network for situation recognition

  • 2019
VIEW 4 EXCERPTS
CITES METHODS, BACKGROUND & RESULTS
HIGHLY INFLUENCED

Mixture-Kernel Graph Attention Network for Situation Recognition

VIEW 4 EXCERPTS
CITES METHODS, RESULTS & BACKGROUND
HIGHLY INFLUENCED

Cross-Task Weakly Supervised Learning From Instructional Videos

VIEW 1 EXCERPT
CITES METHODS

LaSO: Label-Set Operations Networks for Multi-Label Few-Shot Learning

VIEW 1 EXCERPT
CITES BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-10 OF 46 REFERENCES

Situation Recognition: Visual Semantic Role Labeling for Image Understanding

VIEW 10 EXCERPTS

Grounded Semantic Role Labeling

VIEW 1 EXCERPT

Bilinear CNN Models for Fine-Grained Visual Recognition

VIEW 1 EXCERPT

Compact Bilinear Pooling

VIEW 1 EXCERPT

Neural Module Networks

VIEW 1 EXCERPT