• Corpus ID: 17636376

Geometry-Based Region Proposals for Real-Time Robot Detection of Tabletop Objects

  title={Geometry-Based Region Proposals for Real-Time Robot Detection of Tabletop Objects},
  author={Alexander Broad and Brenna Argall},
We present a novel object detection pipeline for localization and recognition in three dimensional environments. Our approach makes use of an RGB-D sensor and combines state-of-the-art techniques from the robotics and computer vision communities to create a robust, real-time detection system. We focus specifically on solving the object detection problem for tabletop scenes, a common environment for assistive manipulators. Our detection pipeline locates objects in a point cloud representation of… 

Figures and Tables from this paper


Sliding Shapes for 3D Object Detection in Depth Images
This paper proposes to use depth maps for object detection and design a 3D detector to overcome the major difficulties for recognition, namely the variations of texture, illumination, shape, viewpoint, clutter, occlusion, self-occlusion and sensor noises.
A textured object recognition pipeline for color and depth image data
We present an object recognition system which leverages the additional sensing and calibration information available in a robotics setting together with large amounts of training data to build high
Learning Rich Features from RGB-D Images for Object Detection and Segmentation
A new geocentric embedding is proposed for depth images that encodes height above ground and angle with gravity for each pixel in addition to the horizontal disparity to facilitate the use of perception in fields like robotics.
A large-scale hierarchical multi-view RGB-D object dataset
A large-scale, hierarchical multi-view object dataset collected using anRGB-D camera is introduced and techniques for RGB-D based object recognition and detection are introduced, demonstrating that combining color and depth information substantially improves quality of results.
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.
3D Object Proposals for Accurate Object Class Detection
This method exploits stereo imagery to place proposals in the form of 3D bounding boxes in the context of autonomous driving and outperforms all existing results on all three KITTI object classes.
Convolutional nets and watershed cuts for real-time semantic Labeling of RGBD videos
An efficient video segmentation approach that computes temporally consistent pixels in a causal manner is proposed, filling the need for causal and real-time applications.
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
This paper proposes a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012 -- achieving a mAP of 53.3%.
Monocular SLAM Supported Object Recognition
In this work, we develop a monocular SLAM-aware object recognition system that is able to achieve considerably stronger recognition performance, as compared to classical object recognition systems
Close-range scene segmentation and reconstruction of 3D point cloud maps for mobile manipulation in domestic environments
A framework for 3D geometric shape segmentation for close-range scenes used in mobile manipulation and grasping, out of sensed point cloud data is presented and a robust geometric mapping pipeline for large input datasets is proposed.