Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning
@article{Chung2021BrickbyBrickCC, title={Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning}, author={H. Chung and Jungtaek Kim and Boris Knyazev and Jinhwi Lee and Graham W. Taylor and Jaesik Park and Minsu Cho}, journal={ArXiv}, year={2021}, volume={abs/2110.15481} }
Discovering a solution in a combinatorial space is prevalent in many real-world problems but it is also challenging due to diverse complex constraints and the vast number of possible combinations. To address such a problem, we introduce a novel formulation, combinatorial construction , which requires a building agent to assemble unit primitives (i.e., LEGO bricks) sequentially – every connection between two bricks must follow a fixed rule, while no bricks mutually overlap. To construct a target…
4 Citations
Sequential Brick Assembly with Efficient Constraint Satisfaction
- Computer ScienceArXiv
- 2022
This work addresses the problem of generating a sequence of LEGO brick assembly with high-fidelity structures, satisfying physical constraints between bricks, by employing a U-shaped sparse 3D convolutional network and devise a sampling strategy to determine the next brick position by considering attachable positions under physical constraints.
Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning
- Computer ScienceICML
- 2022
It is found that the combination of large-scale reinforcement learning and graph-based policies – surprisingly without any additional complexity – is an effective recipe for training agents that not only generalize to complex unseen blueprints in a zero-shot manner, but even operate in a reset-free setting without being trained to do so.
Learning to Assemble Geometric Shapes
- Computer ScienceIJCAI
- 2022
This work introduces the more challenging problem of shape assembly, which involves textureless fragments of arbitrary shapes with indistinctive junctions, and proposes a learning-based approach to solving it, and demonstrates the effectiveness on shape assembly tasks with various scenarios.
IKEA-Manual: Seeing Shape Assembly Step by Step
- Business
- 2022
Human-designed visual manuals are crucial components in shape assembly activities. They provide step-by-step guidance on how we should move and connect different parts in a convenient and…
References
SHOWING 1-10 OF 46 REFERENCES
Learning 3D Part Assembly from a Single Image
- Computer ScienceECCV
- 2020
A novel problem, single-image-guided 3D part assembly, along with a learningbased solution, and proposes a two-module pipeline that leverages strong 2D-3D correspondences and assembly-oriented graph message-passing to infer part relationships.
Structured agents for physical construction
- Computer ScienceICML
- 2019
A suite of challenging physical construction tasks inspired by how children play with blocks are introduced, such as matching a target configuration, stacking blocks to connect objects together, and creating shelter-like structures over target objects.
Visual Reinforcement Learning with Imagined Goals
- Computer ScienceNeurIPS
- 2018
An algorithm is proposed that acquires general-purpose skills by combining unsupervised representation learning and reinforcement learning of goal-conditioned policies, efficient enough to learn policies that operate on raw image observations and goals for a real-world robotic system, and substantially outperforms prior techniques.
Relational inductive biases, deep learning, and graph networks
- Computer ScienceArXiv
- 2018
It is argued that combinatorial generalization must be a top priority for AI to achieve human-like abilities, and that structured representations and computations are key to realizing this objective.
Relational inductive bias for physical construction in humans and machines
- Computer Science, PsychologyCogSci
- 2018
This work introduces a deep reinforcement learning agent which uses object- and relation-centric scene and policy representations and shows that these structured representations allow the agent to outperform both humans and more naive approaches, suggesting that relational inductive bias is an important component in solving structured reasoning problems and for building more intelligent, flexible machines.
Synthesizing Programs for Images using Reinforced Adversarial Learning
- Computer ScienceICML
- 2018
SPIRAL is an adversarially trained agent that generates a program which is executed by a graphics engine to interpret and sample images, and a surprising finding is that using the discriminator's output as a reward signal is the key to allow the agent to make meaningful progress at matching the desired output rendering.
Combinatorial 3D Shape Generation via Sequential Assembly
- Computer ScienceArXiv
- 2020
This work proposes a new 3D shape generation algorithm that aims to create a combinatorial configuration from a set of volumetric primitives, and adopts sequential model-based optimization to tackle the exponential growth of feasible combinations in terms of the number of primitives.
SPARE3D: A Dataset for SPAtial REasoning on Three-View Line Drawings
- Computer Science2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
- 2020
The SPARE3D dataset contains three types of 2D-3D reasoning tasks on view consistency, camera pose, and shape generation, with increasing difficulty, and it is shown that although convolutional networks have achieved superhuman performance in many visual learning tasks, their spatial reasoning performance in SPARE2D is almost equal to random guesses.
Pointer Networks
- Computer ScienceNIPS
- 2015
A new neural architecture to learn the conditional probability of an output sequence with elements that are discrete tokens corresponding to positions in an input sequence using a recently proposed mechanism of neural attention, called Ptr-Nets, which improves over sequence-to-sequence with input attention, but also allows it to generalize to variable size output dictionaries.
High-Dimensional Continuous Control Using Generalized Advantage Estimation
- Computer ScienceICLR
- 2016
This work addresses the large number of samples typically required and the difficulty of obtaining stable and steady improvement despite the nonstationarity of the incoming data by using value functions to substantially reduce the variance of policy gradient estimates at the cost of some bias.