Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension

@article{Kembhavi2017AreYS,
  title={Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension},
  author={Aniruddha Kembhavi and Min Joon Seo and Dustin Schwenk and Jong Hyun Choi and Ali Farhadi and Hannaneh Hajishirzi},
  journal={2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2017},
  pages={5376-5384}
}
We introduce the task of Multi-Modal Machine Comprehension (M3C), which aims at answering multimodal questions given a context of text, diagrams and images. We present the Textbook Question Answering (TQA) dataset that includes 1,076 lessons and 26,260 multi-modal questions, taken from middle school science curricula. Our analysis shows that a significant portion of questions require complex parsing of the text and the diagrams and reasoning, indicating that our dataset is more complex compared… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 21 CITATIONS

References

Publications referenced by this paper.
SHOWING 1-10 OF 33 REFERENCES

and P

  • P. Rajpurkar, J. Zhang, K. Lopyrev
  • Liang. Squad: 100,000+ questions for machine…
  • 2016
Highly Influential
8 Excerpts

Ask

  • H. Xu, K. Saenko
  • attend and answer: Exploring question-guided…
  • 2016
2 Excerpts

Similar Papers

Loading similar papers…