Mastering the game of Go without human knowledge
@article{Silver2017MasteringTG, title={Mastering the game of Go without human knowledge}, author={David Silver and Julian Schrittwieser and Karen Simonyan and Ioannis Antonoglou and Aja Huang and Arthur Guez and Thomas Hubert and Lucas baker and Matthew Lai and Adrian Bolton and Yutian Chen and Timothy P. Lillicrap and Fan Hui and L. Sifre and George van den Driessche and Thore Graepel and Demis Hassabis}, journal={Nature}, year={2017}, volume={550}, pages={354-359} }
A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. [] Key Method AlphaGo becomes its own teacher: a neural network is trained to predict AlphaGo’s own move selections and also the winner of AlphaGo’s games. This neural network improves the strength of the tree search, resulting in higher quality move selection and stronger self-play in the next iteration. Starting tabula rasa, our new program AlphaGo Zero achieved…
7,027 Citations
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
- Computer ScienceArXiv
- 2017
This paper generalises the approach into a single AlphaZero algorithm that can achieve, tabula rasa, superhuman performance in many challenging domains, and convincingly defeated a world-champion program in each case.
Artificial intelligence: Learning to play Go from scratch
- Computer ScienceNature
- 2017
An artificial-intelligence program called AlphaGo Zero has mastered the game of Go without any human data or guidance, and the work suggests that the same fundamental principles of the game have some universal character, beyond human bias.
Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge
- Computer ScienceElectronics
- 2021
The ultimate goal of this paper is to provide exploratory insights and mature auxiliary tools to enable AI researchers and computer-game communities to study, test, and improve these promising state-of-the-art methods at a much lower cost of computing resources.
Mastering the game of Go with deep neural networks and tree search
- Computer ScienceNature
- 2016
Using this search algorithm, the program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0.5, the first time that a computer program has defeated a human professional player in the full-sized game of Go.
AlphaDDA: strategies for adjusting the playing strength of a fully trained AlphaZero system to a suitable human training partner
- Computer SciencePeerJ Comput. Sci.
- 2022
This study shows that AlphaDDA can balance its skill with that of the other AI agents, except for a random player, and believes that theAlphaDDA approach for DDA can be used for any game AI system if the DNN can accurately estimate the value of the game state and the parameter controlling the skills of the AI system is known.
AlphaDDA: game artificial intelligence with dynamic difficulty adjustment using AlphaZero
- Computer ScienceArXiv
- 2021
This study shows thatAlphaDDA can balance its skill with that of the other AI agents, except for a random player, and believes that the AlphaDDA approach can be used for any game in which the DNN can estimate the value from the state.
Scaffolding Learning for the Novice Players of Go
- Computer Science, BiologyICITL
- 2019
This work uses Deep Learning and Reinforcement learning to develop scaffolding learning system for Go, which contains human-like Go programs with various strengths, which allows novice players to learn the game progressively.
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
- Computer ScienceScience
- 2018
This paper generalizes the AlphaZero approach into a single AlphaZero algorithm that can achieve superhuman performance in many challenging games, and convincingly defeated a world champion program in the games of chess and shogi (Japanese chess), as well as Go.
A Game-centric Approach to Teaching Artificial Intelligence
- Computer ScienceCSEDU
- 2019
A game-centric approach to teaching artificial intelligence that follows the historical development of algorithms by popping the hood of these champion bots is reflected, and a server infrastructure for playing card games in perfect information and imperfect information playing mode is made available.
Mastering the game of Stratego with model-free multiagent reinforcement learning
- ArtScience
- 2022
We introduce DeepNash, an autonomous agent that plays the imperfect information game Stratego at a human expert level. Stratego is one of the few iconic board games that artificial intelligence (AI)…
References
SHOWING 1-10 OF 67 REFERENCES
Mastering the game of Go with deep neural networks and tree search
- Computer ScienceNature
- 2016
Using this search algorithm, the program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0.5, the first time that a computer program has defeated a human professional player in the full-sized game of Go.
TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play
- Computer ScienceNeural Computation
- 1994
The latest version of TD-Gammon is now estimated to play at a strong master level that is extremely close to the world's best human players.
Bootstrapping from Game Tree Search
- Computer ScienceNIPS
- 2009
This paper introduces a new algorithm for updating the parameters of a heuristic evaluation function, by updating the heuristic towards the values computed by an alpha-beta search, and implemented this algorithm in a chess program Meep, using a linear heuristic function.
DeepStack: Expert-level artificial intelligence in heads-up no-limit poker
- Computer ScienceScience
- 2017
DeepStack is introduced, an algorithm for imperfect-information settings that combines recursive reasoning to handle information asymmetry, decomposition to focus computation on the relevant decision, and a form of intuition that is automatically learned from self-play using deep learning.
Human-level control through deep reinforcement learning
- Computer ScienceNature
- 2015
This work bridges the divide between high-dimensional sensory inputs and actions, resulting in the first artificial agent that is capable of learning to excel at a diverse array of challenging tasks.
The Integration of A Priori Knowledge into a Go Playing Neural Network
- Computer Science
- 1996
Methods for integrating expert Go knowledge into a learning artiicial neural network are implemented in the program NeuroGo, which is able to achieve a playing strength which is equal to a conventional program playing at a medium level.
Reinforcement Learning: An Introduction
- Computer ScienceIEEE Transactions on Neural Networks
- 2005
This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning
- Computer ScienceNIPS
- 2014
The central idea is to use the slow planning-based agents to provide training data for a deep-learning architecture capable of real-time play, and proposed new agents based on this idea are proposed and shown to outperform DQN.
Giraffe: Using Deep Reinforcement Learning to Play Chess
- Computer ScienceArXiv
- 2015
Giraffe is the most successful attempt thus far at using end-to-end machine learning to play chess, with minimal hand-crafted knowledge given by the programmer.
Computing "Elo Ratings" of Move Patterns in the Game of Go
- Computer ScienceJ. Int. Comput. Games Assoc.
- 2007
A new Bayesian technique for supervised learning of move patterns from game records, based on a generalization of Elo ratings, which outperforms most previous pattern-learning algorithms, both in terms of mean log-evidence, and prediction rate.