Author pages are created from data sourced from our academic publisher partnerships and public sources.
- Publications
- Influence
Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence
- Victor Gabillon, M. Ghavamzadeh, A. Lazaric
- Computer Science, Mathematics
- NIPS
- 3 December 2012
TLDR
Linear Thompson Sampling Revisited
- Marc Abeille, A. Lazaric
- Mathematics, Computer Science
- AISTATS
- 1 November 2016
TLDR
Risk-Aversion in Multi-armed Bandits
- A. Sani, A. Lazaric, R. Munos
- Computer Science
- NIPS
- 3 December 2012
TLDR
Finite-sample analysis of least-squares policy iteration
- A. Lazaric, M. Ghavamzadeh, R. Munos
- Mathematics, Computer Science
- J. Mach. Learn. Res.
- 2012
TLDR
Online Stochastic Optimization under Correlated Bandit Feedback
- Mohammad Gheshlaghi Azar, A. Lazaric, Emma Brunskill
- Mathematics, Computer Science
- ICML
- 3 February 2014
TLDR
Best-Arm Identification in Linear Bandits
- Marta Soare, A. Lazaric, R. Munos
- Computer Science, Mathematics
- NIPS
- 22 September 2014
TLDR
Analysis of a Classification-based Policy Iteration Algorithm
- A. Lazaric, M. Ghavamzadeh, R. Munos
- Mathematics, Computer Science
- ICML
- 21 June 2010
TLDR
Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits
- A. Carpentier, A. Lazaric, Mohammad Ghavamzadeh, Rémi Munos, P. Auer, A. Antos
- Computer Science, Mathematics
- ALT
- 5 October 2011
TLDR
LSTD with Random Projections
- M. Ghavamzadeh, A. Lazaric, O. Maillard, R. Munos
- Computer Science, Mathematics
- NIPS
- 6 December 2010
TLDR
Transfer in Reinforcement Learning: A Framework and a Survey
- A. Lazaric
- Computer Science
- Reinforcement Learning
- 2012
TLDR