Learning local trajectories for high precision robotic tasks: Application to KUKA LBR iiwa Cartesian positioning

  title={Learning local trajectories for high precision robotic tasks: Application to KUKA LBR iiwa Cartesian positioning},
  author={Joris Gu{\'e}rin and O. Gibaru and E. Nyiri and St{\'e}phane Thiery},
  journal={IECON 2016 - 42nd Annual Conference of the IEEE Industrial Electronics Society},
To ease the development of robot learning in industry, two conditions need to be fulfilled. Manipulators must be able to learn high accuracy and precision tasks while being safe for workers in the factory. In this paper, we extend previously submitted work which consist in rapid learning of local high accuracy behaviors. By exploration and regression, linear and quadratic models are learnt for respectively the dynamics and cost function. Iterative Linear Quadratic Gaussian Regulator combined… Expand
Adaptive Force Tracking Control of Redundant Robot Manipulator for Task-space
This study considers the problem of controlling a redundant robot manipulator in the task space. We discuss the relation between the task space and joint space, and also introduce the knowledge ofExpand
A Generic Decision Support Tool to Planning and Assignment Problems: Industrial Application & Industry 4.0
Decision support tools are essential to help the management of industrial systems at different levels: strategic to size the system; tactical to plan activities or assign resources; operational toExpand


Locally optimal control under unknown dynamics with learnt cost function: application to industrial robot positioning
This work proposes a method to learn the cost function directly from the data, in the same way as for the dynamics, which can be defined in terms of any measurable quantity and thus can be chosen more appropriately for the task to be carried out. Expand
Learning to Control a Low-Cost Manipulator using Data-Efficient Reinforcement Learning
It is demonstrated how a low-cost off-the-shelf robotic system can learn closed-loop policies for a stacking task in only a handful of trials-from scratch. Expand
Adaptive Optimal Feedback Control with Learned Internal Dynamics Models
This chapter combines the ILQG framework with learning the forward dynamics for simulated arms, which exhibit large redundancies, both, in kinematics and in the actuation to demonstrate how the approach can compensate for complex dynamic perturbations in an online fashion. Expand
Learning contact-rich manipulation skills with guided policy search
This paper extends a recently developed policy search method and uses it to learn a range of dynamic manipulation behaviors with highly general policy representations, without using known models or example demonstrations, and shows that this method can acquire fast, fluent behaviors after only minutes of interaction time. Expand
Path Planning for a Robot Manipulator based on Probabilistic Roadmap and Reinforcement Learning
The probabilistic roadmap (PRM) method, which is a popular path planning scheme, for a manipulator, can find a collision-free path by connecting the start and goal poses through a roadmap constructedExpand
Reinforcement learning in robotics: A survey
This article attempts to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots by highlighting both key challenges in robot reinforcement learning as well as notable successes. Expand
Hinfinity reinforcement learning control of robot manipulators using fuzzy wavelet networks
In this paper, an H"~ reinforcement learning controller based on a fuzzy wavelet network (FWN) is proposed to perform a position-tracking task for a robot manipulator using the actor-critic reinforcement learning control scheme. Expand
A Survey on Policy Search for Robotics
This work classifies model-free methods based on their policy evaluation strategy, policy update strategy, and exploration strategy and presents a unified view on existing algorithms. Expand
Synthesis and stabilization of complex behaviors through online trajectory optimization
We present an online trajectory optimization method and software platform applicable to complex humanoid robots performing challenging tasks such as getting up from an arbitrary pose on the groundExpand
A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems
  • E. Todorov, Weiwei Li
  • Mathematics
  • Proceedings of the 2005, American Control Conference, 2005.
  • 2005
We present an iterative linear-quadratic-Gaussian method for locally-optimal feedback control of nonlinear stochastic systems subject to control constraints. Previously, similar methods have beenExpand