Application of reinforcement learning to the game of Othello

  title={Application of reinforcement learning to the game of Othello},
  author={Nees Jan van Eck and Michiel C. van Wezel},
  journal={Computers & OR},
Operations research and management science are often confronted with sequential decision making problems with large state spaces. Standard methods that are used for solving such complex problems are associated with some difficulties. As we discuss in this article, these methods are plagued by the so-called curse of dimensionality and the curse of modelling. In this article, we discuss reinforcement learning, a machine learning technique for solving sequential decision making problems with large… CONTINUE READING
Highly Cited
This paper has 37 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 17 extracted citations

Reinforcement Learning: Psychologische und neurobiologische Aspekte

KI - Künstliche Intelligenz • 2013
View 5 Excerpts
Method Support
Highly Influenced

Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play

2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) • 2013
View 2 Excerpts


Publications referenced by this paper.
Showing 1-10 of 30 references

Reinforcement Learning: An Introduction

IEEE Transactions on Neural Networks • 1988
View 12 Excerpts
Highly Influenced

Othello: a minute to learn—a lifetime to master

B. Rose
View 1 Excerpt

Reinforcement learning for long-run average cost

European Journal of Operational Research • 2004
View 1 Excerpt

Nash Q-Learning for General-Sum Stochastic Games

Journal of Machine Learning Research • 2003
View 1 Excerpt

TD - gammon a selfteaching backgammon program , achieves masterlevel play

G Tesauro
Simulation - based optimization : parametric optimization techniques and reinforcement learning • 2003

A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking

A Gosavi, N Bandla, TK. Das
IIE Transactions • 2002
View 1 Excerpt