Model Selection in Contextual Stochastic Bandit Problems
@article{Pacchiano2020ModelSI, title={Model Selection in Contextual Stochastic Bandit Problems}, author={Aldo Pacchiano and My Phan and Yasin Abbasi-Yadkori and A. Rao and Julian Zimmert and Tor Lattimore and Csaba Szepesvari}, journal={ArXiv}, year={2020}, volume={abs/2003.01704} }
We study model selection in stochastic bandit problems. Our approach relies on a master algorithm that selects its actions among candidate base algorithms. While this problem is studied for specific classes of stochastic base algorithms, our objective is to provide a method that can work with more general classes of stochastic base algorithms. We propose a master algorithm inspired by CORRAL \cite{DBLP:conf/colt/AgarwalLNS17} and introduce a novel and generic smoothing transformation for… Expand
15 Citations
Rate-adaptive model selection over a collection of black-box contextual bandit algorithms
- Computer Science, Mathematics
- ArXiv
- 2020
- 1
- Highly Influenced
- PDF
Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
- Computer Science, Mathematics
- ArXiv
- 2020
- 3
- PDF
Pareto Optimal Model Selection in Linear Bandits
- Computer Science, Mathematics
- ArXiv
- 2021
- Highly Influenced
- PDF
Smooth Bandit Optimization: Generalization to Hölder Space
- Computer Science, Mathematics
- AISTATS
- 2021
- Highly Influenced
- PDF
Adapting to misspecification in contextual bandits with offline regression oracles
- Computer Science, Mathematics
- ArXiv
- 2021
- PDF
Upper Confidence Bounds for Combining Stochastic Bandits
- Computer Science, Mathematics
- ArXiv
- 2020
- 1
- Highly Influenced
- PDF
Multitask Bandit Learning through Heterogeneous Feedback Aggregation
- Computer Science, Mathematics
- AISTATS
- 2021
- PDF
References
SHOWING 1-10 OF 35 REFERENCES
Model selection for contextual bandits
- Computer Science, Mathematics
- NeurIPS
- 2019
- 24
- Highly Influential
- PDF
Mostly Exploration-Free Algorithms for Contextual Bandits
- Computer Science, Mathematics
- Manag. Sci.
- 2021
- 59
- PDF
Provably Optimal Algorithms for Generalized Linear Contextual Bandits
- Computer Science, Mathematics
- ICML
- 2017
- 107
- PDF
A Smoothed Analysis of the Greedy Algorithm for the Linear Contextual Bandit Problem
- Computer Science, Mathematics
- NeurIPS
- 2018
- 50
- PDF
Almost Optimal Algorithms for Linear Stochastic Bandits with Heavy-Tailed Payoffs
- Computer Science, Mathematics
- NeurIPS
- 2018
- 17
- PDF
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Computer Science, Mathematics
- Theor. Comput. Sci.
- 2009
- 412
- PDF
Learning with Good Feature Representations in Bandits and in RL with a Generative Model
- Mathematics, Computer Science
- ICML
- 2020
- 35
- PDF