Author pages are created from data sourced from our academic publisher partnerships and public sources.
- Publications
- Influence
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
- Lasse Espeholt, Hubert Soyer, +9 authors K. Kavukcuoglu
- Computer Science, Mathematics
- ICML
- 5 February 2018
TLDR
A Distributional Perspective on Reinforcement Learning
- Marc G. Bellemare, W. Dabney, R. Munos
- Computer Science, Mathematics
- ICML
- 17 July 2017
TLDR
Unifying Count-Based Exploration and Intrinsic Motivation
- Marc G. Bellemare, S. Srinivasan, Georg Ostrovski, T. Schaul, D. Saxton, R. Munos
- Computer Science
- NIPS
- 6 June 2016
TLDR
Minimax Regret Bounds for Reinforcement Learning
- Mohammad Gheshlaghi Azar, Ian Osband, R. Munos
- Mathematics, Computer Science
- ICML
- 16 March 2017
TLDR
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- J. Audibert, R. Munos, Csaba Szepesvari
- Mathematics, Computer Science
- Theor. Comput. Sci.
- 1 April 2009
TLDR
Finite-Time Bounds for Fitted Value Iteration
- R. Munos, Csaba Szepesvari
- Mathematics, Computer Science
- J. Mach. Learn. Res.
- 1 June 2008
TLDR
Safe and Efficient Off-Policy Reinforcement Learning
- R. Munos, Tom Stepleton, A. Harutyunyan, Marc G. Bellemare
- Computer Science, Mathematics
- NIPS
- 8 June 2016
TLDR
Distributional Reinforcement Learning with Quantile Regression
- W. Dabney, M. Rowland, Marc G. Bellemare, R. Munos
- Computer Science, Mathematics
- AAAI
- 27 October 2017
TLDR
Sample Efficient Actor-Critic with Experience Replay
- Ziyu Wang, V. Bapst, +4 authors N. D. Freitas
- Computer Science, Mathematics
- ICLR
- 3 November 2016
TLDR
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
- A. Antos, Csaba Szepesvari, R. Munos
- Mathematics, Computer Science
- Machine Learning
- 1 April 2008
TLDR