From Recognition to Cognition: Visual Commonsense Reasoning

@article{Zellers2018FromRT,
  title={From Recognition to Cognition: Visual Commonsense Reasoning},
  author={Rowan Zellers and Yonatan Bisk and Ali Farhadi and Yejin Choi},
  journal={CoRR},
  year={2018},
  volume={abs/1811.10830}
}
Visual understanding goes well beyond object recognition. With one glance at an image, we can effortlessly imagine the world beyond the pixels: for instance, we can infer people’s actions, goals, and mental states. While this task is easy for humans, it is tremendously difficult for today’s vision systems, requiring higher-order cognition and commonsense reasoning about the world. We formalize this task as Visual Commonsense Reasoning. Given a challenging question about an image, a machine must… CONTINUE READING
Tweets
This paper has been referenced on Twitter 170 times. VIEW TWEETS

References

Publications referenced by this paper.
SHOWING 1-10 OF 96 REFERENCES

Hadamard Product for Low-rank Bilinear Pooling

  • Jin-Hwa Kim, Kyoung Woon On, Woosang Lim, Jeonghee Kim, Jung-Woo Ha, Byoung-Tak Zhang
  • In The 5th International Conference on Learning…
  • 2017
Highly Influential
5 Excerpts

Similar Papers

Loading similar papers…