Generalization in Reinforcement Learning by Wouter Josemans Born February

@inproceedings{Dimitrakakis2009GeneralizationIR,
  title={Generalization in Reinforcement Learning by Wouter Josemans Born February},
  author={Christos Dimitrakakis and Dr. Shimon Whiteson},
  year={2009}
}
In this paper we evaluate two Temporal Difference Reinforcement Learning methods on several different tasks to see how well these methods generalize. The tasks were modeled as Markov Decision Processes with a continuous observation space and a discrete action space. Function approximation was done using linear gradient descent with RBFs as features. The tasks were taken from the Polyathlon domain of the 2009 Reinforcement Learning Competition. It was found that the more sophisticated method… CONTINUE READING