Learning to walk with prior knowledge


In this work a novel approach to Transfer Learning for the use in Deep Reinforcement Learning is introduced. The agent is realized as an actor-critic framework, namely the Deep Deterministic Policy Gradient algorithm. The Q-function and the policy are represented as deep feed-forward networks, that are trained by minimizing the mean squared Bellman error… (More)
DOI: 10.1109/AIM.2017.8014209


4 Figures and Tables

Slides referencing similar topics