On Optimistic versus Randomized Exploration in Reinforcement Learning


We discuss the relative merits of optimistic and randomized approaches to exploration in reinforcement learning. Optimistic approaches presented in the literature apply an optimistic boost to the value estimate at each state-action pair and select actions that are greedy with respect to the resulting optimistic value function. Randomized approaches sample… (More)

4 Figures and Tables


  • Presentations referencing similar topics