Corpus ID: 18670075

A Multiworld Testing Decision Service

  title={A Multiworld Testing Decision Service},
  author={A. Agarwal and S. Bird and Markus Cozowicz and Luong Hoang and J. Langford and Stephen Lee and J. Li and D. Melamed and Gal Oshri and Oswaldo Ribas and S. Sen and Alex Slivkins},
  • A. Agarwal, S. Bird, +9 authors Alex Slivkins
  • Published 2016
  • Computer Science, Mathematics
  • ArXiv
  • Applications and systems are constantly faced with decisions to make, often using a policy to pick from a set of actions based on some contextual information. We create a service that uses machine learning to accomplish this goal. The service uses exploration, logging, and online learning to create a counterfactually sound system supporting a full data lifecycle. The system is general: it works for any discrete choices, with respect to any reward metric, and can work with many learning… CONTINUE READING
    26 Citations
    ItemSuggest: A Data Management Platform for Machine Learned Ranking Services
    • 2
    • PDF
    Programming by Rewards
    • 1
    • Highly Influenced
    • PDF
    Ray: A Distributed Execution Engine for the Machine Learning Ecosystem
    • PDF
    Resonance: Replacing Software Constants with Context-Aware Models in Real-time Communication
    • PDF
    Ray: A Distributed Framework for Emerging AI Applications
    • 251
    • PDF
    Augmenting Policy Learning with Routines Discovered from a Demonstration
    • PDF
    Pyramid: Enhancing Selectivity in Big Data Protection with Count Featurization
    • 7
    • PDF


    Doubly Robust Policy Evaluation and Learning
    • 355
    • PDF
    Contextual Bandit Learning with Predictable Rewards
    • 33
    • PDF
    Machine Learning: The High Interest Credit Card of Technical Debt
    • 188
    • PDF
    Efficient Contextual Semi-Bandit Learning
    • 7
    Using Confidence Bounds for Exploitation-Exploration Trade-offs
    • P. Auer
    • Mathematics, Computer Science
    • J. Mach. Learn. Res.
    • 2002
    • 1,050
    • PDF
    Controlled experiments on the web: survey and practical guide
    • 541
    • PDF
    Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
    • 293
    • PDF