The Promise of Premise: Harnessing Question Premises in Visual Question Answering

@inproceedings{Mahendru2017ThePO,
  title={The Promise of Premise: Harnessing Question Premises in Visual Question Answering},
  author={Aroma Mahendru and Viraj Prabhu and Akrit Mohapatra and Dhruv Batra and Stefan Lee},
  booktitle={EMNLP},
  year={2017}
}
In this paper, we make a simple observation that questions about images often contain premises – objects and relationships implied by the question – and that reasoning about premises can help Visual Question Answering (VQA) models respond more intelligently to irrelevant or previously unseen questions. When presented with a question that is irrelevant to an image, state-of-the-art VQA models will still answer purely based on learned language biases, resulting in nonsensical or even misleading… CONTINUE READING
6 Citations
27 References
Similar Papers

References

Publications referenced by this paper.
Showing 1-10 of 27 references

Similar Papers

Loading similar papers…