KeypointNet: A Large-Scale 3D Keypoint Dataset Aggregated From Numerous Human Annotations
- Yang You, Yujing Lou, Weiming Wang
- Computer ScienceComputer Vision and Pattern Recognition
- 28 February 2020
This work presents KeypointNet: the first large-scale and diverse 3D keypoint dataset that contains 83,231 keypoints and 8,329 3D models from 16 object categories, by leveraging numerous human annotations.
Pointwise Rotation-Invariant Network with Adaptive Sampling and 3D Spherical Voxel Convolution
- Yang You, Yujing Lou, Weiming Wang
- Computer ScienceAAAI Conference on Artificial Intelligence
- 23 November 2018
A brand new point-set learning framework PRIN is proposed, namely, Pointwise Rotation-Invariant Network, focusing on rotation-invariant feature extraction in point clouds analysis, which shows that, on the dataset with randomly rotated point clouds, PRIN demonstrates better performance than state-of-the-art methods without any data augmentation.
PRIN: Pointwise Rotation-Invariant Network
- Yang You, Yujing Lou, Cewu Lu
- Computer ScienceArXiv
- 23 November 2018
This paper proposes a new point-set learning framework named Pointwise Rotation-Invariant Network (PRIN), focusing on achieving rotation-invariance in point clouds, and shows performance better than state-of-the-art methods on part segmentation without data augmentation.
Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes
This work disentangle the direct offset into Local Canonical Coordinates (LCC), box scales and box orientations, which achieves state-of-the-art performance on three standard real-world benchmarks: ScanNet, SceneNN and SUN RGB-D.
PRIN/SPRIN: On Extracting Point-Wise Rotation Invariant Features
- Yang You, Yujing Lou, Cewu Lu
- Computer ScienceIEEE Transactions on Pattern Analysis and Machine…
- 24 February 2021
This paper proposes a brand new point-set learning framework PRIN, namely, Point-wise Rotation Invariant Network, focusing on rotation invariant feature extraction in point clouds analysis, and extends PRIN to a sparse version called SPRIN, which directly operates on sparse point clouds.
Semantic Correspondence via 2D-3D-2D Cycle
- Yang You, Chengkun Li, Weiming Wang
- Computer ScienceArXiv
- 20 April 2020
This paper proposes a new method on predicting semantic correspondences by leveraging it to 3D domain and then project corresponding 3D models back to 2D domain, with their semantic labels.
Fine-grained Object Semantic Understanding from Correspondences
- Yang You, Chengkun Li, Cewu Lu
- Computer ScienceArXiv
- 29 December 2019
This paper proposes a method that outputs dense semantic embeddings based on a novel geodesic consistency loss and shows that it could boost the fine-grained understanding of heterogeneous objects and the inference of dense semantic information is possible.
Human Correspondence Consensus for 3D Object Semantic Understanding
- Yujing Lou, Yang You, Cewu Lu
- Computer ScienceEuropean Conference on Computer Vision
- 29 November 2020
This paper argues that by providing human labeled correspondences between different objects from the same category instead of explicit semantic labels, one can recover rich semantic information of an object and shows that CorresPondenceNet could not only boost fine-grained understanding of heterogeneous objects but also cross-object registration and partial object matching.
Understanding Pixel-Level 2D Image Semantics With 3D Keypoint Knowledge Engine
- Yang You, Chengkun Li, Cewu Lu
- Computer ScienceIEEE Transactions on Pattern Analysis and Machine…
- 13 April 2021
This paper proposes a new method on predicting image corresponding semantics in 3D domain and then projecting them back onto 2D images to achieve pixel-level understanding and shows that this method gives comparative and even superior results on standard semantic benchmarks.
Localization with Sampling-Argmax
- Jiefeng Li, Tong Chen, Ruiqi Shi, Yujing Lou, Yong-Lu Li, Cewu Lu
- Computer ScienceNeural Information Processing Systems
- 17 October 2021
Soft-argmax operation is commonly adopted in detection-based methods to localize the target position in a differentiable manner. However, training the neural network with soft-argmax makes the shape…
...
...