Author pages are created from data sourced from our academic publisher partnerships and public sources.
- Publications
- Influence
Kullback–Leibler upper confidence bounds for optimal sequential allocation
- O. Cappé, A. Garivier, O. Maillard, Rémi Munos, Gilles Stoltz
- Mathematics
- 3 October 2012
We consider optimal sequential allocation in the context of the so-called stochastic multi-armed bandit model. We describe a generic index policy, in the sense of Gittins (1979), based on upper… Expand
Concentration inequalities for sampling without replacement
- R. Bardenet, O. Maillard
- Mathematics
- 16 September 2013
Concentration inequalities quantify the deviation of a random variable from a fixed value. In spite of numerous applications, such as opinion surveys or ecological counting procedures , few… Expand
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences
- O. Maillard, R. Munos, G. Stoltz
- Mathematics, Computer Science
- COLT
- 29 May 2011
TLDR
LSTD with Random Projections
- M. Ghavamzadeh, A. Lazaric, O. Maillard, R. Munos
- Mathematics, Computer Science
- NIPS
- 6 December 2010
TLDR
Compressed Least-Squares Regression
- O. Maillard, R. Munos
- Mathematics, Computer Science
- NIPS
- 7 December 2009
TLDR
Latent Bandits
- O. Maillard, Shie Mannor
- Mathematics, Computer Science
- ICML
- 21 June 2014
TLDR
The non-stationary stochastic multi-armed bandit problem
- Robin Allesiardo, R. Féraud, O. Maillard
- Mathematics, Computer Science
- International Journal of Data Science and…
- 30 March 2017
TLDR
Robust Risk-Averse Stochastic Multi-armed Bandits
- O. Maillard
- Computer Science
- ALT
- 6 October 2013
TLDR
Streaming kernel regression with provably adaptive mean, variance, and regularization
- A. Durand, O. Maillard, Joelle Pineau
- Mathematics, Computer Science
- J. Mach. Learn. Res.
- 2 August 2017
TLDR
Sequential change-point detection: Laplace concentration of scan statistics and non-asymptotic delay bounds
- O. Maillard
- Computer Science
- ALT
- 10 March 2019
TLDR