Efficient Selection of Multiple Bandit Arms: Theory and Practice

Abstract

We consider the general, widely applicable problem of selecting from n real-valued random variables a subset of size m of those with the highest means, based on as few samples as possible. This problem, which we denote Explore-m, is a core aspect in several stochastic optimization algorithms, and applications of simulation and industrial engineering. The… (More)

3 Figures and Tables

Topics

Statistics

0102020112012201320142015201620172018
Citations per Year

59 Citations

Semantic Scholar estimates that this publication has 59 citations based on the available data.

See our FAQ for additional information.

  • Presentations referencing similar topics