• Publications
  • Influence
VQA: Visual Question Answering
We propose the task of free-form and open-ended Visual Question Answering (VQA). Given an image and a natural language question about the image, the task is to provide an accurate natural languageExpand
  • 1,848
  • 450
  • PDF
VQA: Visual Question Answering
We propose the task of free-form and open-ended Visual Question Answering (VQA). Given an image and a natural language question about the image, the task is to provide an accurate natural languageExpand
  • 145
  • 20
Zero-Shot Learning via Visual Abstraction
One of the main challenges in learning fine-grained visual categories is gathering training images. Recent work in Zero-Shot Learning (ZSL) circumvents this challenge by describing categories viaExpand
  • 68
  • 4
  • PDF
We are Humor Beings: Understanding and Predicting Visual Humor
Humor is an integral part of human lives. Despite being tremendously impactful, it is perhaps surprising that we do not have a detailed understanding of humor yet. As interactions between humans andExpand
  • 30
  • 1
  • PDF
Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes
We present an approach to simultaneously perform semantic segmentation and prepositional phrase attachment resolution for captioned images. Some ambiguities in language cannot be resolved withoutExpand
  • 19
  • 1
  • PDF
Measuring Machine Intelligence Through Visual Question Answering
As machines have become more intelligent, there has been a renewed interest in methods for measuring their intelligence. A common approach is to propose tasks for which a human excels, but one whichExpand
  • 26
  • PDF
Resolving vision and language ambiguities together: Joint segmentation & prepositional attachment resolution in captioned scenes
Abstract We present an approach to simultaneously perform semantic segmentation and prepositional phrase attachment resolution for captioned images. Some ambiguities in language cannot be resolvedExpand
  • 2
  • PDF
A New Approach for Measuring Terrain Profiles
This paper presents a new approach for measuring terrain profiles. The proposed approach uses RGB-D sensors to measure terrain surface relative to the vehicle. Since the RGB-D sensor is an areaExpand
  • 2
Grid-Based Scan-to-Map Matching for Accurate Simultaneous Localization and Mapping: Theory and Preliminary Numerical Study
This paper presents a grid-based scan-to-map matching technique for accurate simultaneous localization and mapping (SLAM). At every acquisition of a new scan, the proposed technique estimates theExpand
  • 2