Author pages are created from data sourced from our academic publisher partnerships and public sources.
Share This Author
Hindsight Experience Replay
- Marcin Andrychowicz, Dwight Crow, +7 authors Wojciech Zaremba
- Computer Science, MathematicsNIPS
- 5 July 2017
A novel technique is presented which allows sample-efficient learning from rewards which are sparse and binary and therefore avoid the need for complicated reward engineering and may be seen as a form of implicit curriculum.
Learning to learn by gradient descent by gradient descent
This paper shows how the design of an optimization algorithm can be cast as a learning problem, allowing the algorithm to learn to exploit structure in the problems of interest in an automatic way.
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
- Matthias Plappert, Marcin Andrychowicz, +9 authors Wojciech Zaremba
- Computer Science, MathematicsArXiv
- 26 February 2018
A suite of challenging continuous control tasks (integrated with OpenAI Gym) based on currently existing robotics hardware and following a Multi-Goal Reinforcement Learning (RL) framework are introduced.
Learning dexterous in-hand manipulation
- Marcin Andrychowicz, Bowen Baker, +13 authors Wojciech Zaremba
- Computer Science, MathematicsInt. J. Robotics Res.
- 1 August 2018
This work uses reinforcement learning (RL) to learn dexterous in-hand manipulation policies that can perform vision-based object reorientation on a physical Shadow Dexterous Hand, and these policies transfer to the physical robot despite being trained entirely in simulation.
Overcoming Exploration in Reinforcement Learning with Demonstrations
- Ashvin Nair, Bob McGrew, Marcin Andrychowicz, Wojciech Zaremba, P. Abbeel
- Computer Science, MathematicsIEEE International Conference on Robotics and…
- 28 September 2017
This work uses demonstrations to overcome the exploration problem and successfully learn to perform long-horizon, multi-step robotics tasks with continuous control such as stacking blocks with a robot arm.
Parameter Space Noise for Exploration
- Matthias Plappert, Rein Houthooft, +6 authors Marcin Andrychowicz
- Computer Science, MathematicsICLR
- 6 June 2017
This work demonstrates that RL with parameter noise learns more efficiently than traditional RL with action space noise and evolutionary strategies individually through experimental comparison of DQN, DDPG, and TRPO on high-dimensional discrete action environments as well as continuous control tasks.
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
- Xue Bin Peng, Marcin Andrychowicz, Wojciech Zaremba, P. Abbeel
- Computer Science, EngineeringIEEE International Conference on Robotics and…
- 18 October 2017
By randomizing the dynamics of the simulator during training, this paper is able to develop policies that are capable of adapting to very different dynamics, including ones that differ significantly from the dynamics on which the policies were trained.
Secure Multiparty Computations on Bitcoin
- Marcin Andrychowicz, Stefan Dziembowski, Daniel Malinowski, Lukasz Mazurek
- Computer ScienceIEEE Symposium on Security and Privacy
- 23 March 2016
The Bit coin system can be used to go beyond the standard "emulation-based" definition of the MPCs, by constructing protocols that link their inputs and the outputs with the real Bit coin transactions.
One-Shot Imitation Learning
A meta-learning framework for achieving one-shot imitation learning, where ideally, robots should be able to learn from very few demonstrations of any given task, and instantly generalize to new situations of the same task, without requiring task-specific engineering.
Solving Rubik's Cube with a Robot Hand
It is demonstrated that models trained only in simulation can be used to solve a manipulation problem of unprecedented complexity on a real robot, made possible by a novel algorithm, which is called automatic domain randomization (ADR), and a robot platform built for machine learning.