Davis Gilton

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
This paper describes a linear multi-armed bandit algorithm that exploits sparsity in the underlying unknown weight vector controlling rewards. In linear multi-armed bandits, a user chooses a sequence of (slot machine) “arms” to pull, and each arm pull results in the user receiving a stochastic reward with mean equal to the inner product(More)
  • 1