Reinforcement Learning: An Introduction
@article{Sutton2005ReinforcementLA,
title={Reinforcement Learning: An Introduction},
author={R. Sutton and A. Barto},
journal={IEEE Transactions on Neural Networks},
year={2005},
volume={16},
pages={285-286}
}Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. [...] Key Method Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view…Expand Abstract
Figures, Tables, and Topics from this paper.
Figures and Tables

figure 1.1 
figure 2.1 
figure 2.2 
figure 2.3 
figure 2.4 
figure 2.5 
figure 2.6 
figure 3.1 
figure 3.1 
figure 3.2 
figure 3.3 
figure 3.4 
figure 3.5 
figure 3.6 
figure 3.7 
figure 3.8 
figure 4.1 
figure 4.2 
figure 4.3 
figure 5.1 
figure 5.2 
figure 5.3 
figure 5.4 
figure 5.5 
figure 5.6 
figure 6.1 
figure 6.2 
figure 6.3 
figure 6.4 
figure 6.5 
figure 6.6 
figure 6.7 
figure 6.8 
figure 7.1 
figure 7.2 
figure 7.3 
figure 7.4 
figure 7.5 
figure 8.1 
figure 8.11 
figure 8.2 
figure 8.3 
figure 8.4 
figure 8.5 
figure 8.6 
figure 8.7 
figure 8.8 
figure 8.9 
figure 9.1 
figure 9.10 
figure 9.11 
figure 9.12 
figure 9.13 
figure 9.14 
figure 9.15 
figure 9.2 
figure 9.3 
figure 9.4 
figure 9.5 
figure 9.6 
figure 9.7 
figure 9.8 
figure 9.9 
figure 10.1 
figure 10.2 
figure 10.3 
figure 10.4 
figure 10.5 
figure 11.1 
figure 11.2 
figure 11.3 
figure 12.1 
figure 12.2 
figure 12.3 
figure 12.4 
figure 12.5 
figure 12.6 
figure 12.7 
figure 13.2 
figure 14.1 
figure 14.2 
figure 14.7 
figure 14.8 
figure 14.9 
figure 15.1 
figure 15.2 
figure 15.3 
figure 15.4 
figure 15.5 
figure 15.6 
figure 16.1 
table 16.1 
figure 16.11 
figure 16.12 
figure 16.16 
figure 16.2 
figure 16.3 
figure 16.4 
figure 16.6 
figure 16.7 
figure 16.8 
figure 16.9 
figure 17.1 
figure 20 
figure 21 
figure 22 
figure 23
25,306 Citations
Collectives for the Optimal Combination of Imperfect Objects
- Physics, Computer Science
- 2003
Open Access
Reciprocity inspired learning for opportunistic spectrum access in cognitive radio networks
- Computer Science
- 2013
Open Access
Машинное обучение - состояние и перспективы (Machine Learning: State of the Art and Perspectives)
- Computer Science
- 2013
REINFORCEMENT LEARNING IN THE CONTROL OF A SIMULATED LIFE SUPPORT SYSTEM
- Computer Science
- 2004
Highly Influenced
Open Access
References
Publications referenced by this paper.
SHOWING 1-10 OF 632 REFERENCES
Self-improving reactive agents based on reinforcement learning, planning and teaching
- Computer Science
- 2004
564
Open Access
Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons
- Computer Science
- 1991
284
Open Access
Importance sampling for reinforcement learning with multiple objectives
- Computer Science
- 2001
66
Open Access