Sliding Shapes for 3D Object Detection in Depth Images

@inproceedings{Song2014SlidingSF,
  title={Sliding Shapes for 3D Object Detection in Depth Images},
  author={Shuran Song and J. Xiao},
  booktitle={ECCV},
  year={2014}
}
The depth information of RGB-D sensors has greatly simplified some common challenges in computer vision and enabled breakthroughs for several tasks. [...] Key Method We take a collection of 3D CAD models and render each CAD model from hundreds of viewpoints to obtain synthetic depth maps. For each depth rendering, we extract features from the 3D point cloud and train an Exemplar-SVM classifier. During testing and hard-negative mining, we slide a 3D detection window in 3D space. Experiment results show that our…Expand
Single Multi-feature detector for Amodal 3D Object Detection in RGB-D Images
TLDR
This paper proposes a single end-to-end framework based on the deep neural networks which hierarchically incorporates appearance and geometric features from 2.5D representation to 3D objects for fast and high-accuracy amodal 3D object detections in RGB-D images. Expand
2D-Driven 3D Object Detection in RGB-D Images
TLDR
The approach makes best use of the 2D information to quickly reduce the search space in 3D, benefiting from state-of-the-art 2D object detection techniques. Expand
2 D-Driven 3 D Object Detection in RGB-D Images
In this paper, we present a technique that places 3D bounding boxes around objects in an RGB-D scene. Our approach makes best use of the 2D information to quickly reduce the search space in 3D,Expand
3D Object Detection Incorporating Instance Segmentation and Image Restoration
TLDR
A 3D object detection approach based on instance segmentation and image restoration based on the Criminisi Algorithm that improves the average precision score compared with the F-PointNet method. Expand
Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images
  • Shuran Song, J. Xiao
  • Computer Science
  • 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
  • 2016
TLDR
This work proposes the first 3D Region Proposal Network (RPN) to learn objectness from geometric shapes and the first joint Object Recognition Network (ORN) to extract geometric features in 3D and color features in 2D. Expand
3D object detection: Learning 3D bounding boxes from scaled down 2D bounding boxes in RGB-D images
TLDR
An efficient 3D object detection system that can predict object location, size, and orientation is presented and outperforms the state-of-the-art detection methods by a remarkable margin with faster detection time. Expand
Geometry-Based Region Proposals for Real-Time Robot Detection of Tabletop Objects
We present a novel object detection pipeline for localization and recognition in three dimensional environments. Our approach makes use of an RGB-D sensor and combines state-of-the-art techniquesExpand
Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation
TLDR
This paper exploits the recent success of depth estimation from monocular images and learns a deep depth estimation model, and proposes an RGB-D semantic segmentation method, which applies a multi-task training scheme: semantic label prediction and depth value regression. Expand
Frustum PointNets for 3D Object Detection from RGB-D Data
TLDR
This work directly operates on raw point clouds by popping up RGBD scans and leverages both mature 2D object detectors and advanced 3D deep learning for object localization, achieving efficiency as well as high recall for even small objects. Expand
RGB-D image-based Object Detection: from Traditional Methods to Deep Learning Techniques
TLDR
Deep learning techniques have now revolutionized the field of computer vision, including RGB-D object detection, achieving an unprecedented level of performance. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 98 REFERENCES
RGB-(D) scene labeling: Features and algorithms
TLDR
The main objective is to empirically understand the promises and challenges of scene labeling with RGB-D and adapt the framework of kernel descriptors that converts local similarities (kernels) to patch descriptors to capture appearance (RGB) and shape (D) similarities. Expand
Convolutional-Recursive Deep Learning for 3D Object Classification
TLDR
This work introduces a model based on a combination of convolutional and recursive neural networks (CNN and RNN) for learning features and classifying RGB-D images, which obtains state of the art performance on a standardRGB-D object dataset while being more accurate and faster during training and testing than comparable architectures such as two-layer CNNs. Expand
A learned feature descriptor for object recognition in RGB-D data
TLDR
A new, learned, local feature descriptor for RGB-D images, the convolutional k-means descriptor, which automatically learns feature responses in the neighborhood of detected interest points and is able to combine all available information, such as color and depth into one, concise representation. Expand
Depth kernel descriptors for object recognition
TLDR
A set of kernel features on depth images that model size, 3D shape, and depth edges in a single framework that significantly improve the capabilities of depth and RGB-D (color+depth) recognition, achieving 10–15% improvement in accuracy over the state of the art. Expand
Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images
TLDR
This work proposes algorithms for object boundary detection and hierarchical segmentation that generalize the gPb-ucm approach of [2] by making effective use of depth information and shows how this contextual information in turn improves object recognition. Expand
A textured object recognition pipeline for color and depth image data
We present an object recognition system which leverages the additional sensing and calibration information available in a robotics setting together with large amounts of training data to build highExpand
Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses
TLDR
A novel framework is proposed that explores the compatibility between segmentation hypotheses of the object in the image and the corresponding 3D map using a generalization of the structural latent SVM formulation in 3D as well as the definition of a new loss function defined over the 3D space in training. Expand
Holistic Scene Understanding for 3D Object Detection with RGBD Cameras
TLDR
A holistic approach that exploits 2D segmentation, 3D geometry, as well as contextual relations between scenes and objects, and develops a conditional random field to integrate information from different sources to classify the cuboids is proposed. Expand
Semantic Labeling of 3D Point Clouds for Indoor Scenes
TLDR
This paper proposes a graphical model that captures various features and contextual relations, including the local visual appearance and shape cues, object co-occurence relationships and geometric relationships, and applies these algorithms successfully on a mobile robot for the task of finding objects in large cluttered rooms. Expand
Hough Transform and 3D SURF for Robust Three Dimensional Classification
TLDR
A new robust 3D shape classification method is proposed, which extends a robust 2D feature descriptor, SURF, to be used in the context of 3D shapes and shows how3D shape class recognition can be improved by probabilistic Hough transform based methods, already popular in 2D. Expand
...
1
2
3
4
5
...