Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data

  title={Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data},
  author={Fabian Manhardt and Diego Martin Arroyo and C. Rupprecht and Benjamin Busam and Nassir Navab and Federico Tombari},
  journal={2019 IEEE/CVF International Conference on Computer Vision (ICCV)},
3D object detection and pose estimation from a single image are two inherently ambiguous problems. Oftentimes, objects appear similar from different viewpoints due to shape symmetries, occlusion and repetitive textures. This ambiguity in both detection and pose estimation means that an object instance can be perfectly described by several different poses and even classes. In this work we propose to explicitly deal with these ambiguities. For each object instance we predict multiple 6D pose… 

Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation

This work devise a network to reconstruct the three rotation axis primitive images of a target object and predict the underlying uncertainty along each primitive axis and optimize multi-object poses using visual measurements and camera poses by treating it as an object SLAM problem.

Unseen Object 6D Pose Estimation: A Benchmark and Baselines

A new task that enables and facilitates algorithms to estimate the 6D pose estimation of novel objects during testing and proposes a new metric named Infimum ADD (IADD) which is an invariant measurement for objects with different types of pose ambiguity.

On Object Symmetries and 6D Pose Estimation from Images

It is explained why symmetrical objects can be a challenge when training machine learning algorithms that aim at estimating their 6D pose from images, and an efficient and simple solution that relies on the normalization of the pose rotation is proposed.

Ki-Pode: Keypoint-based Implicit Pose Distribution Estimation of Rigid Objects

A novel method for estimating a probability density function over 6D object poses is proposed, formulated implicitly using keypoints as an intermediary object representation which ensures a high expressiveness of the distribution as well as a high level of interpretability of the estimates.

EPOS: Estimating 6D Pose of Objects With Symmetries

A new method for estimating the 6D pose of rigid objects with available 3D models from a single RGB input image using a robust and efficient variant of the PnP-RANSAC algorithm, which outperforms all RGB and most RGB-D and D methods on the T-LESS and LM-O datasets.

Learning Implicit Probability Distribution Functions for Symmetric Orientation Estimation from RGB Images Without Pose Labels

This work designs a two-stage pipeline consisting of point cloud registration and render-and-compare validation to generate multiple symmetrical pseudo-ground-truth pose labels for each image, and introduces an automatic pose labeling scheme.

Robust 6D Object Pose Estimation by Learning RGB-D Features

This work proposes a novel discrete- continuous formulation for rotation regression to resolve this local-optimum problem, and shows that the proposed method outperforms state-of-the-art approaches.

CPS: Class-level 6D Pose and Shape Estimation From Monocular Images

This paper proposes the first deep learning approach for class-wise monocular 6D pose estimation, coupled with metric shape retrieval, and proposes a new loss formulation which directly optimizes over all parameters, i.e. 3D orientation, translation, scale and shape at the same time.

Pose Ambiguity Elimination Algorithm for 3C Components Assembly Pose Estimation in Point Cloud

A pose ambiguity elimination algorithm based on PCA (Principal Component Analysis) and 2D image template matching is proposed and has higher accuracy than traditional methods and the efficiency meets the need.

Learning Orientation Distributions for Object Pose Estimation

This work proposes two learned methods for estimating a distribution over an object’s orientation that give the best performance on objects with unknown symmetries, accurately modeling both symmetric and non-symmetric objects, without any requirement of symmetry annotation.



On Evaluation of 6D Object Pose Estimation

The paper defines 6D object pose estimation problems, proposes an evaluation methodology and introduces three new pose error functions that deal with pose ambiguity, compared with functions commonly used in the literature and shown to remove certain types of non-intuitive outcomes.

Learning 6D Object Pose Estimation Using 3D Object Coordinates

This work addresses the problem of estimating the 6D Pose of specific objects from a single RGB-D image by presenting a learned, intermediate representation in form of a dense 3D object coordinate labelling paired with a dense class labelling.

Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image

A regularized, auto-context regression framework is developed which iteratively reduces uncertainty in object coordinate and object label predictions and an efficient way to marginalize object coordinate distributions over depth is introduced to deal with missing depth information.

Model Based Training, Detection and Pose Estimation of Texture-Less 3D Objects in Heavily Cluttered Scenes

A framework for automatic modeling, detection, and tracking of 3D objects with a Kinect and shows how to build the templates automatically from 3D models, and how to estimate the 6 degrees-of-freedom pose accurately and in real-time.

Detection and fine 3D pose estimation of texture-less objects in RGB-D images

Experimental evaluation shows that the proposed method yields a recognition rate comparable to the state of the art, while its complexity is sub-linear in the number of templates.

Learning descriptors for object recognition and 3D pose estimation

  • Paul WohlhartV. Lepetit
  • Computer Science
    2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
  • 2015
This work introduces a simple but powerful approach to computing descriptors for object views that efficiently capture both the object identity and 3D pose, and trains a Convolutional Neural Network to compute these descriptors by enforcing simple similarity and dissimilarity constraints between the descriptors.

DeepIM: Deep Iterative Matching for 6D Pose Estimation

A novel deep neural network for 6D pose matching named DeepIM is proposed, trained to predict a relative pose transformation using a disentangled representation of 3D location and 3D orientation and an iterative training process.

PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes

This work introduces PoseCNN, a new Convolutional Neural Network for 6D object pose estimation, which is highly robust to occlusions, can handle symmetric objects, and provide accurate pose estimation using only color images as input.

Deep-6DPose: Recovering 6D Object Pose from a Single RGB Image

An end-toend deep learning framework that jointly detects, segments, and most importantly recovers 6D poses of object instances from a single RGB image, and is considerably faster than competing multi-stage methods, offers an inference speed of 10 fps that is well suited for robotic applications.

BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth

  • Mahdi RadV. Lepetit
  • Computer Science
    2017 IEEE International Conference on Computer Vision (ICCV)
  • 2017
A novel method for 3D object detection and pose estimation from color images only that uses segmentation to detect the objects of interest in 2D even in presence of partial occlusions and cluttered background and is the first to report results on the Occlusion dataset using color imagesonly.