Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning
@article{Moerland2020ThinkTF, title={Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning}, author={T. M. Moerland and Anna Deichler and S. Baldi and J. Broekens and Catholijn M. Jonker}, journal={ArXiv}, year={2020}, volume={abs/2005.07404} }
Planning and reinforcement learning are two key approaches to sequential decision making. Multi-step approximate real-time dynamic programming, a recently successful algorithm class of which AlphaZero [Silver et al., 2018] is an example, combines both by nesting planning within a learning loop. However, the combination of planning and learning introduces a new question: how should we balance time spend on planning, learning and acting? The importance of this trade-off has not been explicitly… CONTINUE READING
3 Citations
CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs
- Computer Science
- ArXiv
- 2020
- PDF
References
SHOWING 1-10 OF 30 REFERENCES
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
- Computer Science, Mathematics
- NeurIPS
- 2018
- 314
- PDF
Beyond the One Step Greedy Approach in Reinforcement Learning
- Computer Science, Mathematics
- ICML
- 2018
- 17
- PDF
Reinforcement Learning: An Introduction
- Computer Science
- IEEE Transactions on Neural Networks
- 2005
- 26,976
- Highly Influential
- PDF
Multi-Step Greedy and Approximate Real Time Dynamic Programming
- Computer Science, Mathematics
- ArXiv
- 2019
- 4
Mastering the game of Go without human knowledge
- Medicine, Computer Science
- Nature
- 2017
- 3,773
- Highly Influential
- PDF
Exploration Strategies for Model-based Learning in Multi-agent Systems: Exploration Strategies
- Computer Science
- Autonomous Agents and Multi-Agent Systems
- 2004
- 62
- PDF