Corpus ID: 14273320

PILCO: A Model-Based and Data-Efficient Approach to Policy Search

@inproceedings{Deisenroth2011PILCOAM,
  title={PILCO: A Model-Based and Data-Efficient Approach to Policy Search},
  author={M. Deisenroth and C. Rasmussen},
  booktitle={ICML},
  year={2011}
}
  • M. Deisenroth, C. Rasmussen
  • Published in ICML 2011
  • Computer Science
  • In this paper, we introduce PILCO, a practical, data-efficient model-based policy search method. [...] Key Method Policy evaluation is performed in closed form using state-of-the-art approximate inference. Furthermore, policy gradients are computed analytically for policy improvement. We report unprecedented learning efficiency on challenging and high-dimensional control tasks.Expand Abstract
    935 Citations

    Figures, Tables, and Topics from this paper

    Explore Further: Topics Discussed in This Paper

    Gradient-Aware Model-based Policy Search
    • 3
    • Highly Influenced
    • PDF
    Model-Based Reinforcement Learning via Proximal Policy Optimization
    Data-Efficient Reinforcement Learning in Continuous-State POMDPs
    • 7
    • PDF
    Uncertainty-aware Model-based Policy Optimization
    • 5
    • PDF
    SAMBA: Safe Model-Based & Active Reinforcement Learning
    • 3
    • Highly Influenced
    • PDF
    Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs
    • 16
    • PDF
    Regularizing Model-Based Planning with Energy-Based Models
    • 6
    • PDF
    Hierarchical model-based policy optimization: from actions to action sequences and back
    • PDF
    Exploring Model-based Planning with Policy Networks
    • 49
    • PDF
    Model-Based Policy Gradients with Parameter-Based Exploration by Least-Squares Conditional Density Estimation
    • 20
    • Highly Influenced
    • PDF

    References

    SHOWING 1-10 OF 39 REFERENCES
    Using inaccurate models in reinforcement learning
    • 202
    • Highly Influential
    • PDF
    Probabilistic Inference for Fast Learning in Control
    • 24
    • PDF
    Model-free off-policy reinforcement learning in continuous environment
    • P. Wawrzynski, A. Pacut
    • Computer Science
    • 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541)
    • 2004
    • 15
    • PDF
    Incorporating Domain Models into Bayesian Optimization for RL
    • 10
    • PDF
    Efficient reinforcement learning using Gaussian processes
    • 164
    • PDF
    Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning
    • 82
    • Highly Influential
    • PDF
    Learning to Control a Low-Cost Manipulator using Data-Efficient Reinforcement Learning
    • 175
    • PDF
    Reinforcement Learning in Continuous Time and Space
    • K. Doya
    • Mathematics, Medicine
    • Neural Computation
    • 2000
    • 785
    • Highly Influential
    • PDF
    Policy Gradient Methods for Robotics
    • Jan Peters, S. Schaal
    • Engineering, Computer Science
    • 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems
    • 2006
    • 467
    • PDF