Geometry-Based Region Proposals for Real-Time Robot Detection of Tabletop Objects
@article{Broad2017GeometryBasedRP, title={Geometry-Based Region Proposals for Real-Time Robot Detection of Tabletop Objects}, author={Alexander Broad and Brenna Argall}, journal={ArXiv}, year={2017}, volume={abs/1703.04665} }
We present a novel object detection pipeline for localization and recognition in three dimensional environments. Our approach makes use of an RGB-D sensor and combines state-of-the-art techniques from the robotics and computer vision communities to create a robust, real-time detection system. We focus specifically on solving the object detection problem for tabletop scenes, a common environment for assistive manipulators. Our detection pipeline locates objects in a point cloud representation of…
Figures and Tables from this paper
References
SHOWING 1-10 OF 40 REFERENCES
Sliding Shapes for 3D Object Detection in Depth Images
- Computer ScienceECCV
- 2014
This paper proposes to use depth maps for object detection and design a 3D detector to overcome the major difficulties for recognition, namely the variations of texture, illumination, shape, viewpoint, clutter, occlusion, self-occlusion and sensor noises.
A textured object recognition pipeline for color and depth image data
- Computer Science2012 IEEE International Conference on Robotics and Automation
- 2012
We present an object recognition system which leverages the additional sensing and calibration information available in a robotics setting together with large amounts of training data to build high…
Learning Rich Features from RGB-D Images for Object Detection and Segmentation
- Computer ScienceECCV
- 2014
A new geocentric embedding is proposed for depth images that encodes height above ground and angle with gravity for each pixel in addition to the horizontal disparity to facilitate the use of perception in fields like robotics.
A large-scale hierarchical multi-view RGB-D object dataset
- Computer Science2011 IEEE International Conference on Robotics and Automation
- 2011
A large-scale, hierarchical multi-view object dataset collected using anRGB-D camera is introduced and techniques for RGB-D based object recognition and detection are introduced, demonstrating that combining color and depth information substantially improves quality of results.
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
- Computer ScienceIEEE Transactions on Pattern Analysis and Machine Intelligence
- 2015
This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.
3D Object Proposals for Accurate Object Class Detection
- Computer ScienceNIPS
- 2015
This method exploits stereo imagery to place proposals in the form of 3D bounding boxes in the context of autonomous driving and outperforms all existing results on all three KITTI object classes.
Convolutional nets and watershed cuts for real-time semantic Labeling of RGBD videos
- Computer ScienceJ. Mach. Learn. Res.
- 2014
An efficient video segmentation approach that computes temporally consistent pixels in a causal manner is proposed, filling the need for causal and real-time applications.
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
- Computer Science2014 IEEE Conference on Computer Vision and Pattern Recognition
- 2014
This paper proposes a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012 -- achieving a mAP of 53.3%.
Monocular SLAM Supported Object Recognition
- Computer ScienceRobotics: Science and Systems
- 2015
In this work, we develop a monocular SLAM-aware object recognition system that is able to achieve considerably stronger recognition performance, as compared to classical object recognition systems…
Close-range scene segmentation and reconstruction of 3D point cloud maps for mobile manipulation in domestic environments
- Computer Science2009 IEEE/RSJ International Conference on Intelligent Robots and Systems
- 2009
A framework for 3D geometric shape segmentation for close-range scenes used in mobile manipulation and grasping, out of sensed point cloud data is presented and a robust geometric mapping pipeline for large input datasets is proposed.