Corpus ID: 208202327

Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates

  title={Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates},
  author={Diogo Carbonera Luvizon and Hedi Tabia and David Picard},
3D human pose estimation is frequently seen as the task of estimating 3D poses relative to the root body joint. Alternatively, in this paper, we propose a 3D human pose estimation method in camera coordinates, which allows effective combination of 2D annotated data and 3D poses, as well as a straightforward multi-view generalization. To that end, we cast the problem into a different perspective, where 3D poses are predicted in the image plane, in pixels, and the absolute depth is estimated in… Expand

Figures and Tables from this paper


3D Pictorial Structures for Multiple Human Pose Estimation
A novel 3D pictorial structures (3DPS) model is introduced that infers 3D human body configurations from the authors' reduced state space and is generic and applicable to both single and multiple human pose estimation. Expand
Multi-view 3D Human Pose Estimation in Complex Environment
This work introduces a framework for unconstrained 3D human upper body pose estimation from multiple camera views in complex environment and demonstrates that this approach outperforms the state-of-the-art in experiments with large and challenging real-world data from an outdoor setting. Expand
Learning Monocular 3D Human Pose Estimation from Multi-view Images
This paper trains the system to predict the same pose in all views, and proposes a method to estimate camera pose jointly with human pose, which lets us utilize multiview footage where calibration is difficult, e.g., for pan-tilt or moving handheld cameras. Expand
Exploiting Temporal Information for 3D Human Pose Estimation
A sequence-to-sequence network composed of layer-normalized LSTM units with shortcut connections connecting the input to the output on the decoder side and imposed temporal smoothness constraint during training is designed, which helps the network to recover temporally consistent 3D poses over a sequence of images even when the 2D pose detector fails. Expand
Self-Supervised Learning of 3D Human Pose Using Multi-View Geometry
EpipolarPose is presented, a self-supervised learning method for 3D human pose estimation, which does not need any 3D ground-truth data or camera extrinsics, and a new performance measure Pose Structure Score (PSS) which is a scale invariant, structure aware measure to evaluate the structural plausibility of a pose with respect to its ground truth. Expand
Multi-view Pictorial Structures for 3D Human Pose Estimation
This paper proposes a multi-view pictorial structures model that builds on recent advances in 2D pose estimation and incorporates evidence across multiple viewpoints to allow for robust 3D poses estimation. Expand
3D Human pose estimation: A review of the literature and analysis of covariates
An extensive experimental evaluation of state-of-the-art approaches in a synthetic dataset created specifically for 3D human pose estimation, which along with its ground truth is made publicly available for research purposes. Expand
3D Human Pose Estimation = 2D Pose Estimation + Matching
This work explores 3D human pose estimation from a single RGB image using a simple architecture that reasons through intermediate 2D pose predictions, and demonstrates how this architecture is straightforward to implement with off-the-shelf2D pose estimation systems and 3D mocap libraries. Expand
Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video
This paper addresses the challenge of 3D full-body human pose estimation from a monocular image sequence with a novel approach that integrates a sparsity-driven 3D geometric prior and temporal smoothness and outperforms a publicly available 2D pose estimation baseline on the challenging PennAction dataset. Expand
A generalizable approach for multi-view 3D human pose regression
This work proposes a two-stage approach to detect and estimate 3D human poses, which separates SV pose detection from MV 3D pose estimation and demonstrates that incorporating the detector's characteristics is important to build a robust 3D regression model and that the resulting regression model generalizes well to new MV environments. Expand