• Corpus ID: 238259392

InfiniteForm: A synthetic, minimal bias dataset for fitness applications

  title={InfiniteForm: A synthetic, minimal bias dataset for fitness applications},
  author={Andrew J. Weitz and Lina Colucci and Sidney R Primas and Brinnae Bent},
The growing popularity of remote fitness has increased the demand for highly accurate computer vision models that track human poses. However, the best methods still fail in many real-world fitness scenarios, suggesting that there is a domain gap between current datasets and real-world fitness data. To enable the field to address fitness-specific vision problems, we created InfiniteForm – an opensource synthetic dataset of 60k images with diverse fitness poses (15 categories), both singleand… 

Figures from this paper

Accurate 3D Body Shape Regression using Metric and Semantic Attributes
This is the first demonstration that 3D body shape regression from images can be trained from easy-to-obtain anthropometric measurements and linguistic shape attributes.


AGORA: Avatars in Geography Optimized for Regression Analysis
This work introduces AGORA, a synthetic dataset with high realism and highly accurate ground truth, and evaluates existing state-of-the-art methods for 3D human pose estimation on this dataset, finding that most methods perform poorly on images of children.
2D Human Pose Estimation: New Benchmark and State of the Art Analysis
A novel benchmark "MPII Human Pose" is introduced that makes a significant advance in terms of diversity and difficulty, a contribution that is required for future developments in human body models.
Learning from Synthetic Humans
This work presents SURREAL (Synthetic hUmans foR REAL tasks): a new large-scale dataset with synthetically-generated but realistic images of people rendered from 3D sequences of human motion capture data and shows that CNNs trained on this synthetic dataset allow for accurate human depth estimation and human part segmentation in real RGB images.
VIBE: Video Inference for Human Body Pose and Shape Estimation
This work defines a novel temporal network architecture with a self-attention mechanism and shows that adversarial training, at the sequence level, produces kinematically plausible motion sequences without in-the-wild ground-truth 3D labels.
Yoga-82: A New Dataset for Fine-grained Classification of Human Poses
This work presents the classification accuracy of the state-of-the-art convolutional neural network architectures on Yoga-82, a dataset for large-scale yoga pose recognition with 82 classes and provides hierarchical labels for yoga poses based on the body configuration of the pose.
DensePose: Dense Human Pose Estimation in the Wild
This work establishes dense correspondences between an RGB image and a surface-based representation of the human body, a task referred to as dense human pose estimation, and improves accuracy through cascading, obtaining a system that delivers highly-accurate results at multiple frames per second on a single gpu.
Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation
A new annotated database of challenging consumer images is introduced, an order of magnitude larger than currently available datasets, and over 50% relative improvement in pose estimation accuracy over a state-of-the-art method is demonstrated.
Pose Trainer: Correcting Exercise Posture using Pose Estimation
Pose Trainer is introduced, an application that detects the user's exercise pose and provides personalized, detailed recommendations on how the user can improve their form, using the state of the art in pose estimation.
Expressive Body Capture: 3D Hands, Face, and Body From a Single Image
This work uses the new method, SMPLify-X, to fit SMPL-X to both controlled images and images in the wild, and evaluates 3D accuracy on a new curated dataset comprising 100 images with pseudo ground-truth.
Microsoft COCO: Common Objects in Context
We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene