Deterministic Policy Gradient Algorithms

@inproceedings{Silver2014DeterministicPG,
  title={Deterministic Policy Gradient Algorithms},
  author={David Silver and Guy Lever and Nicolas Heess and Thomas Degris and Daan Wierstra and Martin A. Riedmiller},
  booktitle={ICML},
  year={2014}
}
In this paper we consider deterministic policy gradient algorithms for reinforcement learning with continuous actions. The deterministic policy gradient has a particularly appealing form: it is the expected gradient of the action-value function. This simple form means that the deterministic policy gradient can be estimated much more efficiently than the usual stochastic policy gradient. To ensure adequate exploration, we introduce an off-policy actor-critic algorithm that learns a deterministic… CONTINUE READING
Highly Influential
This paper has highly influenced 56 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 546 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

Citations

Publications citing this paper.
Showing 1-10 of 357 extracted citations

Controlling bicycle using deep deterministic policy gradient algorithm

2017 14th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI) • 2017
View 6 Excerpts
Highly Influenced

A Deep Deterministic Policy Gradient Approach to Medication Dosing and Surveillance in the ICU

2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) • 2018
View 7 Excerpts
Highly Influenced

Data Science

Communications in Computer and Information Science • 2018
View 10 Excerpts
Highly Influenced

546 Citations

010020030020152016201720182019
Citations per Year
Semantic Scholar estimates that this publication has 546 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-10 of 21 references

Reinforcement Learning: An Introduction

IEEE Transactions on Neural Networks • 1998
View 4 Excerpts
Highly Influenced

Natural Actor-Critic

Neurocomputing • 2008
View 6 Excerpts
Highly Influenced

Some notes on gradient descent

M. Toussaint
http://ipvs.informatik.uni-stuttgart. de/mlr/marc/notes/gradientDescent.pdf. • 2012
View 2 Excerpts

Similar Papers

Loading similar papers…