Corpus ID: 7299259

PAC-Bayesian Analysis of Martingales and Multiarmed Bandits

  title={PAC-Bayesian Analysis of Martingales and Multiarmed Bandits},
  author={Yevgeny Seldin and F. Laviolette and J. Shawe-Taylor and Jan Peters and P. Auer},
  • Yevgeny Seldin, F. Laviolette, +2 authors P. Auer
  • Published 2011
  • Computer Science, Mathematics
  • ArXiv
  • We present two alternative ways to apply PAC-Bayesian analysis to sequences of dependent random variables. The first is based on a new lemma that enables to bound expectations of convex functions of certain dependent random variables by expectations of the same functions of independent Bernoulli random variables. This lemma provides an alternative tool to Hoeffding-Azuma inequality to bound concentration of martingale values. Our second approach is based on integration of Hoeffding-Azuma… CONTINUE READING
    5 Citations
    PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits
    • 12
    • PDF
    (APPRENTISSAGE SÉQUENTIEL : Bandits, Statistique et Renforcement
    • 7
    • PDF
    PAC-Bayesian Analysis of the Exploration-Exploitation Trade-off
    • 7
    • PDF
    Regulating Greed Over Time
    • 4
    • PDF


    PAC-Bayesian Generalisation Error Bounds for Gaussian Process Classification
    • 194
    • PDF
    A PAC analysis of a Bayesian estimator
    • 108
    Bayesian Gaussian process models : PAC-Bayesian generalisation error bounds and sparse approximations
    • 172
    • PDF
    The Nonstochastic Multiarmed Bandit Problem
    • 1,600
    • PDF
    Chromatic PAC-Bayes Bounds for Non-IID Data
    • 62
    • PDF
    Finite-time Analysis of the Multiarmed Bandit Problem
    • 4,224
    • PDF
    PAC-Bayesian Model Selection for Reinforcement Learning
    • 20
    • PDF
    A PAC-Bayes Bound for Tailored Density Estimation
    • 21
    On Bayesian bounds
    • 53
    • PDF
    PAC-Bayesian Analysis of Co-clustering and Beyond
    • 54
    • PDF