Off-policy evaluation for slate recommendation

  title={Off-policy evaluation for slate recommendation},
  author={Adith Swaminathan and Akshay Krishnamurthy and Alekh Agarwal and Miroslav Dud{\'i}k and John Langford and Damien Jose and Imed Zitouni},
This paper studies the evaluation of policies that recommend an ordered set of items (e.g., a ranking) based on some context—a common scenario in web search, ads, and recommendation. We build on techniques from combinatorial bandits to introduce a new practical estimator that uses logged data to estimate a policy’s performance. A thorough empirical evaluation on real-world data reveals that our estimator is accurate in a variety of settings, including as a subroutine in a learningto-rank task… CONTINUE READING
This paper has been referenced on Twitter 21 times. VIEW TWEETS


Publications referenced by this paper.

Similar Papers

Loading similar papers…