• Publications
  • Influence
Are we ready for autonomous driving? The KITTI vision benchmark suite
TLDR
We take advantage of our autonomous driving platform to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection. Expand
  • 5,511
  • 1236
  • PDF
Vision meets robotics: The KITTI dataset
TLDR
We present a novel dataset captured from a VW station wagon for use in mobile robotics and autonomous driving research. Expand
  • 3,395
  • 708
  • PDF
Skip-Thought Vectors
TLDR
We describe an approach for unsupervised learning of a generic, distributed sentence encoder that can produce highly generic sentence representations that are robust and perform well in practice. Expand
  • 1,654
  • 244
  • PDF
Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books
TLDR
This paper aims to align books to their movie releases in order to provide rich descriptive explanations for visual content that go semantically far beyond the captions available in the current datasets. Expand
  • 897
  • 122
  • PDF
The Role of Context for Object Detection and Semantic Segmentation in the Wild
TLDR
We propose a novel deformable part-based model, which exploits both local context around each candidate detection as well as global context at the level of the scene. Expand
  • 695
  • 121
  • PDF
Efficient Large-Scale Stereo Matching
TLDR
We propose a novel approach to binocular stereo for fast matching of high-resolution images and show that state-of-the-art performance can be achieved with significant speedups. Expand
  • 706
  • 101
  • PDF
Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts
TLDR
We propose a novel approach to i) handle large deformations and partial occlusions in animals (as examples of highly deformable objects), ii) describe them in terms of body parts, and iii) detect them when they are hard to detect (e.g., animals depicted at low resolution). Expand
  • 357
  • 78
  • PDF
Learning to Reweight Examples for Robust Deep Learning
TLDR
We propose a novel meta-learning algorithm that learns to assign weights to training examples based on their gradient directions. Expand
  • 396
  • 76
  • PDF
Monocular 3D Object Detection for Autonomous Driving
TLDR
The goal of this paper is to perform 3D object detection from a single monocular image in the domain of autonomous driving. Expand
  • 433
  • 71
  • PDF
Order-Embeddings of Images and Language
TLDR
We introduce a general method for learning ordered representations, and show how it can be applied to a variety of tasks involving images and language. Expand
  • 367
  • 67
  • PDF
...
1
2
3
4
5
...