#### Filter Results:

- Full text PDF available (33)

#### Publication Year

2002

2017

- This year (3)
- Last 5 years (28)
- Last 10 years (30)

#### Publication Type

#### Co-author

#### Publication Venue

#### Key Phrases

Learn More

- Zohar S. Karnin, Tomer Koren, Oren Somekh
- ICML
- 2013

We study the problem of exploration in stochastic Multi-Armed Bandits. Even in the simplest setting of identifying the best arm, there remains a logarithmic multiplica-tive gap between the known lower and upper bounds for the number of arm pulls required for the task. This extra logarithmic factor is quite meaningful in nowadays large-scale applications. We… (More)

- Elad Hazan, Tomer Koren, Roi Livni, Yishay Mansour
- COLT
- 2016

We consider the problem of prediction with expert advice when the losses of the experts have low-dimensional structure: they are restricted to an unknown d-dimensional subspace. We devise algorithms with regret bounds that are independent of the number of experts and depend only on the rank d. For the stochastic model we show a tight bound of Θp ? dT q, and… (More)

- T. Koren
- 2005

This document describes a method to improve the bandwidth utilization of RTP streams over network paths that carry multiple Real-time Transport Protocol (RTP) streams in parallel between two endpoints, as in voice trunking. The method combines standard protocols that provide compression, multiplexing, and tunneling over a network path for the purpose of… (More)

- Elad Hazan, Tomer Koren
- Math. Program.
- 2016

We consider the fundamental problem of maximizing a general quadratic function over an ellipsoidal domain, also known as the trust region problem. We give the first provable linear-time (in the number of non-zero entries of the input) algorithm for approximately solving this problem. Specifically, our algorithm returns an ǫ-approximate solution in time˜O(N/… (More)

- Tomer Koren, Kfir Y. Levy
- NIPS
- 2015

We consider Empirical Risk Minimization (ERM) in the context of stochastic optimization with exp-concave and smooth losses—a general optimization framework that captures several important learning problems including linear and logistic regression, learning SVMs with the squared hinge-loss, portfolio selection and more. In this setting, we establish the… (More)

- Aharon Ben-Tal, Elad Hazan, Tomer Koren, Shie Mannor
- Operations Research
- 2015

Robust optimization is a common framework in optimization under uncertainty when the problem parameters are not known, but it is rather known that the parameters belong to some given uncertainty set. In the robust optimization framework the problem solved is a min-max problem where a solution is judged according to its performance on the worst possible… (More)

- Ofer Dekel, Jian Ding, Tomer Koren, Yuval Peres
- STOC
- 2014

We study the adversarial multi-armed bandit problem in a setting where the player incurs a unit cost each time he switches actions. We prove that the player's <i>T</i>-round minimax regret in this setting is [EQUATION], thereby closing a fundamental gap in our understanding of learning with bandit feedback. In the corresponding full-information version of… (More)

- Tomer Koren
- COLT
- 2013

Stochastic exp-concave optimization is an important primitive in machine learning that captures several fundamental problems, including linear regression, logistic regression and more. The exp-concavity property allows for fast convergence rates, as compared to general stochastic optimization. However, current algorithms that attain such rates scale poorly… (More)

- Ofer Dekel, Ronen Eldan, Tomer Koren
- NIPS
- 2015

Bandit convex optimization is one of the fundamental problems in the field of online learning. The best algorithm for the general bandit convex optimization problem guarantees a regret of e O(T 5/6), while the best known lower bound is ⌦(T 1/2). Many attempts have been made to bridge the huge gap between these bounds. A particularly interesting special case… (More)

- Noga Alon, Nicolò Cesa-Bianchi, Ofer Dekel, Tomer Koren
- COLT
- 2015

We study a general class of online learning problems where the feedback is specified by a graph. This class includes online prediction with expert advice and the multi-armed bandit problem, but also several learning problems where the online player does not necessarily observe his own loss. We analyze how the structure of the feedback graph controls the… (More)