Reinforcement Learning: A Survey

@article{Kaelbling1996ReinforcementLA,
  title={Reinforcement Learning: A Survey},
  author={L. Kaelbling and M. Littman and A. Moore},
  journal={J. Artif. Intell. Res.},
  year={1996},
  volume={4},
  pages={237-285}
}
  • L. Kaelbling, M. Littman, A. Moore
  • Published 1996
  • Mathematics, Computer Science
  • J. Artif. Intell. Res.
  • This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in… CONTINUE READING

    Figures, Tables, and Topics from this paper.

    Reinforcement Learning: A Tutorial.
    • 158
    • PDF
    A survey of inverse reinforcement learning techniques
    • 35
    Algorithms for Reinforcement Learning
    • 798
    • PDF
    A survey of reinforcement learning in relational domains
    • 67
    • PDF
    Reinforcement Learning: An Introduction
    • 25,543
    • PDF
    Use of Reinforcement Learning as a Challenge: A Review
    • 3
    • Highly Influenced
    A survey of inverse reinforcement learning techniques
    • 36
    • PDF
    Reinforcement learning in robotics: A survey
    • 1,426
    • PDF

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 230 REFERENCES
    Reinforcement learning for robots using neural networks
    • 744
    • PDF
    Learning in embedded systems
    • 714
    On-line Q-learning using connectionist systems
    • 1,159
    Q-learning
    • 5,538
    • Highly Influential
    • PDF
    Reward Functions for Accelerated Learning
    • 399
    • PDF
    Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • 2,917
    • PDF
    To Discount or Not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning
    • 48