• Publications
  • Influence
Unsupervised Monocular Depth Estimation with Left-Right Consistency
TLDR
This paper proposes a novel training objective that enables the convolutional neural network to learn to perform single image depth estimation, despite the absence of ground truth depth data, and produces state of the art results for monocular depth estimation on the KITTI driving dataset.
Digging Into Self-Supervised Monocular Depth Estimation
TLDR
It is shown that a surprisingly simple model, and associated design choices, lead to superior predictions, and together result in both quantitatively and qualitatively improved depth maps compared to competing self-supervised methods.
Semantic object classes in video: A high-definition ground truth database
TLDR
The Cambridge-driving Labeled Video Database (CamVid) is presented as the first collection of videos with object class semantic labels, complete with metadata, and the relevance of the database is evaluated by measuring the performance of an algorithm from each of three distinct domains: multi-class object recognition, pedestrian detection, and label propagation.
Segmentation and Recognition Using Structure from Motion Point Clouds
TLDR
This work proposes an algorithm for semantic segmentation based on 3D point clouds derived from ego-motion that works well on sparse, noisy point clouds, and unlike existing approaches, does not need appearance-based descriptors.
Harmonic Networks: Deep Translation and Rotation Equivariance
TLDR
H-Nets are presented, a CNN exhibiting equivariance to patch-wise translation and 360-rotation, and it is demonstrated that their layers are general enough to be used in conjunction with the latest architectures and techniques, such as deep supervision and batch normalization.
Patch Based Synthesis for Single Depth Image Super-Resolution
TLDR
This work presents an algorithm to synthetically increase the resolution of a solitary depth image using only a generic database of local patches, and shows how important further depth-specific processing, such as noise removal and correct patch normalization, dramatically improves results.
Structured Prediction of Unobserved Voxels from a Single Depth Image
TLDR
This work proposes an algorithm that can complete the unobserved geometry of tabletop-sized objects, based on a supervised model trained on already available volumetric elements, that maps from a local observation in a single depth image to an estimate of the surface shape in the surrounding neighborhood.
Unsupervised Bayesian Detection of Independent Motion in Crowds
TLDR
An unsupervised data driven Bayesian clustering algorithm which has detection of individual entities as its primary goal and can be augmented with subject-specific filtering, but is shown to already be effective at detecting individual entities in crowds of people, insects, and animals.
Self-Supervised Monocular Depth Hints
TLDR
This work studies the problem of ambiguous reprojections in depth-prediction from stereo-based self-supervision, and introduces Depth Hints to alleviate their effects, and produces state-of-the-art depth predictions on the KITTI benchmark.
RAPter: rebuilding man-made scenes with regular arrangements of planes
TLDR
A novel selection formulation to directly balance between data fitting and the simplicity of the resulting arrangement of extracted planes is proposed, which allows less-dominant orientations to still retain their internal regularity, and not become overwhelmed and regularized by the dominant scene orientations.
...
1
2
3
4
5
...