VQA: Visual Question Answering

@article{Agrawal2015VQAVQ,
  title={VQA: Visual Question Answering},
  author={Aishwarya Agrawal and Jiasen Lu and Stanislaw Antol and Margaret Mitchell and C. L. Zitnick and Devi Parikh and Dhruv Batra},
  journal={International Journal of Computer Vision},
  year={2015},
  volume={123},
  pages={4-31}
}
We propose the task of free-form and open-ended Visual Question Answering (VQA). Given an image and a natural language question about the image, the task is to provide an accurate natural language answer. Mirroring real-world scenarios, such as helping the visually impaired, both the questions and answers are open-ended. Visual questions selectively target different areas of an image, including background details and underlying context. As a result, a system that succeeds at VQA typically needs… Expand
2,179 Citations
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
  • 40
  • PDF
Question Relevance in Visual Question Answering
  • 1
  • Highly Influenced
  • PDF
Question Relevance in Visual Question Answering
  • 1
  • Highly Influenced
  • PDF
Proposing Plausible Answers for Open-ended Visual Question Answering
  • 1
  • Highly Influenced
  • PDF
Revisiting Visual Question Answering Baselines
  • 201
  • Highly Influenced
  • PDF
FVQA: Fact-Based Visual Question Answering
  • 144
  • PDF
iVQA: Inverse Visual Question Answering
  • 16
  • Highly Influenced
  • PDF
Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool
  • 7
  • Highly Influenced
  • PDF
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
  • 55
  • PDF
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 76 REFERENCES
Yin and Yang: Balancing and Answering Binary Visual Questions
  • 198
  • PDF
Ask Your Neurons: A Neural-Based Approach to Answering Questions about Images
  • 483
  • PDF
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question
  • 364
  • PDF
MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text
  • 490
  • PDF
Exploring Models and Data for Image Question Answering
  • 472
  • PDF
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
  • 804
  • PDF
Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics (Extended Abstract)
  • 777
  • PDF
From captions to visual concepts and back
  • 1,000
  • PDF
Open question answering over curated and extracted knowledge bases
  • 339
  • PDF
Visual Madlibs: Fill in the Blank Description Generation and Question Answering
  • 102
  • PDF
...
1
2
3
4
5
...