Human-level control through deep reinforcement learning

@article{Mnih2015HumanlevelCT,
  title={Human-level control through deep reinforcement learning},
  author={Volodymyr Mnih and Koray Kavukcuoglu and David Silver and Andrei A. Rusu and Joel Veness and Marc G. Bellemare and Alex Graves and Martin A. Riedmiller and Andreas Fidjeland and Georg Ostrovski and Stig Petersen and Charles Beattie and Amir Sadik and Ioannis Antonoglou and Helen King and Dharshan Kumaran and Daan Wierstra and Shane Legg and Demis Hassabis},
  journal={Nature},
  year={2015},
  volume={518},
  pages={529-533}
}
The theory of reinforcement learning provides a normative account, deeply rooted in psychological and neuroscientific perspectives on animal behaviour, of how agents may optimize their control of an environment. To use reinforcement learning successfully in situations approaching real-world complexity, however, agents are confronted with a difficult task: they must derive efficient representations of the environment from high-dimensional sensory inputs, and use these to generalize past… CONTINUE READING
BETA

Citations

Publications citing this paper.
SHOWING 1-10 OF 3,856 CITATIONS, ESTIMATED 32% COVERAGE

A Kernel Loss for Solving the Bellman Equation

VIEW 5 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Autonomous Penetration Testing using Reinforcement Learning

VIEW 32 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Deep Reinforcement Learning via L-BFGS Optimization.

VIEW 19 EXCERPTS
CITES METHODS, BACKGROUND & RESULTS
HIGHLY INFLUENCED

Deep Reinforcement Learning with Decorrelation

VIEW 8 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Energy Storage Management via Deep Q-Networks

VIEW 10 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

HQTimer: A Hybrid ${Q}$ -Learning-Based Timeout Mechanism in Software-Defined Networks

  • IEEE Transactions on Network and Service Management
  • 2019
VIEW 25 EXCERPTS
CITES BACKGROUND, METHODS & RESULTS
HIGHLY INFLUENCED

Latent Dirichlet Allocation for Internet Price War

  • AAAI 2019
  • 2019
VIEW 4 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Learning Feature Extraction for Transfer from Simulation to Reality

  • 2019
VIEW 7 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

FILTER CITATIONS BY YEAR

2000
2019

CITATION STATISTICS

  • 844 Highly Influenced Citations

  • Averaged 1,018 Citations per year over the last 3 years

  • 33% Increase in citations per year in 2018 over 2017

References

Publications referenced by this paper.
SHOWING 1-2 OF 2 REFERENCES

Human-level control through deep reinforcement learning

V Mnih
  • Nature 518,
  • 2015

Similar Papers

Loading similar papers…