Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
- Jia Zheng, Junfei Zhang, Jing Li, Rui Tang, Shenghua Gao, Zihan Zhou
- Computer ScienceEuropean Conference on Computer Vision
- 1 August 2019
This paper presents a new synthetic dataset, Structured3D, with the aim of providing large-scale photo-realistic images with rich 3D structure annotations for a wide spectrum of structured 3D modeling tasks, and takes advantage of the availability of professional interior designs to automatically extract 3D structures from them.
Single-Image Piece-Wise Planar 3D Reconstruction via Associative Embedding
- Zehao Yu, Jia Zheng, Dongze Lian, Zihan Zhou, Shenghua Gao
- Computer ScienceComputer Vision and Pattern Recognition
- 26 February 2019
A novel two-stage method based on associative embedding, inspired by its recent success in instance segmentation, that is able to detect an arbitrary number of planes and facilitate many real-time applications such as visual SLAM and human-robot interaction.
Density Map Regression Guided Detection Network for RGB-D Crowd Counting and Localization
- Dongze Lian, Jing Li, Jia Zheng, Weixin Luo, Shenghua Gao
- Computer ScienceComputer Vision and Pattern Recognition
- 1 June 2019
A regression guided detection network (RDNet) is proposed for RGB-D crowd counting and a depth-aware anchor is designed for better initialization of anchor sizes in detection framework to improve the robustness of detection-based approaches for small/tiny heads.
PPGNet: Learning Point-Pair Graph for Line Segment Detection
- Ziheng Zhang, Zhengxin Li, Shenghua Gao
- Computer ScienceComputer Vision and Pattern Recognition
- 9 May 2019
This paper proposes to describe junctions, line segments and relationships between them with a simple graph, which is more structured and informative than end-point representation used in existing line segment detection methods and introduces the PPGNet, a convolutional neural network that directly infers a graph from an image.
Geometric Structure Based and Regularized Depth Estimation From 360 Indoor Imagery
- Lei Jin, Yanyu Xu, Shenghua Gao
- Computer ScienceComputer Vision and Pattern Recognition
- 1 June 2020
A novel learning-based depth estimation framework that leverages the geometric structure of a scene to conduct depth estimation and demonstrates that the method can be applied to counterfactual depth.
Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image
- Cheng Yang, Jia Zheng, Xili Dai, Rui Tang, Yi Ma, Xiaojun Yuan
- EngineeringIEEE Workshop/Winter Conference on Applications…
- 16 April 2021
This paper employs Convolutional Neural Networks to detect planes and vertical lines between adjacent walls, and optimize the 3D plane parameters to reconstruct a geometrically consistent room layout between planes and lines.
Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
- Zhao-Heng Yin, Jia Zheng, Weixin Luo, Shenhan Qian, Hanling Zhang, Shenghua Gao
- Computer ScienceComputer Vision and Pattern Recognition
- 18 March 2021
The frame selection problem in the interactive VOS is formulated as a Markov Decision Process, where an agent is learned to recommend the frame under a deep reinforcement learning framework, making the interactive setting more practical in the wild.
Layout-Guided Novel View Synthesis from a Single Indoor Panorama
- Jiale Xu, Jia Zheng, Yanyu Xu, Rui Tang, Shenghua Gao
- Computer ScienceComputer Vision and Pattern Recognition
- 31 March 2021
This paper makes the first attempt to generate novel views from a single indoor panorama and take the large camera translations into consideration and uses Convolutional Neural Networks to extract the deep features and estimate the depth map from the source-view image.
Neural Face Identification in a 2D Wireframe Projection of a Manifold Object
- Kehan Wang, Jia Zheng, Zihan Zhou
- Computer ScienceComputer Vision and Pattern Recognition
- 8 March 2022
This paper approaches the classical problem of face identification from a novel data-driven point of view, and adopts a variant of the popular Transformer model to predict the edges associated with the same face in a natural order.
MINERVAS: Massive INterior EnviRonments VirtuAl Synthesis
- Haocheng Ren, Hao Zhang, H. Bao
- Computer ScienceArXiv
- 13 July 2021
MINERVAS, a Massive INterior EnviRonments VirtuAl Synthesis system, to facilitate the 3D scene modification and the 2D image synthesis for various vision tasks, and empowers users to access commercial scene databases with millions of indoor scenes and protects the copyright of core data assets.
...
...