In probability theory, the multi-armed bandit problem (sometimes called the K- or N-armed bandit problem) is a problem in which a gambler at a row ofâ€¦Â (More)

Semantic Scholar uses AI to extract papers important to this topic.

2015

2015

This paper is about the study of Multiâ€“Armed Bandit (MAB) approaches for pricing applications, where a seller needs to identifyâ€¦Â (More)

Is this relevant?

Highly Cited

2012

Highly Cited

2012

- SÃ©bastien Bubeck, NicolÃ² Cesa-Bianchi
- Foundations and Trends in Machine Learning
- 2012

Multi-armed bandit problems are the most basic examples of sequential decision problems with an explorationâ€“exploitation tradeâ€¦Â (More)

Is this relevant?

Highly Cited

2012

Highly Cited

2012

- Shipra Agrawal, Navin Goyal
- COLT
- 2012

The multi-armed bandit problem is a popular model for studying exploration/exploitation trade-off in sequential decision problemsâ€¦Â (More)

Is this relevant?

Highly Cited

2010

Highly Cited

2010

We formulate and study a decentralized multi-armed bandit (MAB) problem. There are M distributed players competing for Nâ€¦Â (More)

Is this relevant?

2008

2008

This paper considers the multi-armed bandit problem with multiple simultaneous arm pulls. We develop a new â€˜irrevocableâ€¦Â (More)

Is this relevant?

Highly Cited

2007

Highly Cited

2007

- BANDIT PROBLEMS, Aditya Mahajan, Demosthenis Teneketzis
- 2007

Multi-armed bandit (MAB) problems are a class of sequential resource allocation problems concerned with allocating one or moreâ€¦Â (More)

Is this relevant?

Highly Cited

2007

Highly Cited

2007

- Sandeep Pandey, Deepayan Chakrabarti, Deepak Agarwal
- ICML
- 2007

We provide a framework to exploit dependencies among arms in multi-armed bandit problems, when the dependencies are in the formâ€¦Â (More)

Is this relevant?

Highly Cited

2005

Highly Cited

2005

- JoannÃ¨s Vermorel, Mehryar Mohri
- ECML
- 2005

The multi-armed bandit problem for a gambler is to decide which arm of a K-slot machine to pull to maximize his total reward in aâ€¦Â (More)

Is this relevant?

Highly Cited

2002

Highly Cited

2002

- Eyal Even-Dar, Shie Mannor, Yishay Mansour
- COLT
- 2002

The bandit problem is revisited and considered under the PAC model. Our main contribution in this part is to show that given nâ€¦Â (More)

Is this relevant?

Highly Cited

1987

Highly Cited

1987

- Michael N. Katehakis, Arthur F. Veinott
- Math. Oper. Res.
- 1987

The multi-armed bandit problem arises in sequentially allocating effcot to one of N prefects and sequentially asngning patientsâ€¦Â (More)

Is this relevant?