Mastering the game of Go with deep neural networks and tree search

@article{Silver2016MasteringTG,
  title={Mastering the game of Go with deep neural networks and tree search},
  author={D. Silver and Aja Huang and Chris J. Maddison and A. Guez and L. Sifre and George van den Driessche and Julian Schrittwieser and Ioannis Antonoglou and Vedavyas Panneershelvam and Marc Lanctot and S. Dieleman and Dominik Grewe and John Nham and Nal Kalchbrenner and Ilya Sutskever and T. Lillicrap and M. Leach and K. Kavukcuoglu and T. Graepel and Demis Hassabis},
  journal={Nature},
  year={2016},
  volume={529},
  pages={484-489}
}
  • D. Silver, Aja Huang, +17 authors Demis Hassabis
  • Published 2016
  • Computer Science, Medicine
  • Nature
  • The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. [...] Key Method Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this…Expand Abstract
    Playing Go without Game Tree Search Using Convolutional Neural Networks
    2
    GoGoGo : Improving Deep Neural Network Based Go Playing AI with Residual Networks
    Google AI algorithm masters ancient game of Go
    83
    Mastering the game of Go without human knowledge
    3111
    Learning Self-Game-Play Agents for Combinatorial Optimization Problems
    3
    Learning to Play the Chess Variant Crazyhouse Above World Champion Level With Deep Neural Networks and Human Data
    2
    Monster Carlo 2: Integrating Learning and Tree Search for Machine Playtesting
    1

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 81 REFERENCES
    Mastering the game of Go without human knowledge
    3111
    A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
    594
    Temporal-difference search in computer Go
    73
    Balancing MCTS by Dynamically Adjusting the Komi Value
    13