Many-objective stochastic path finding using reinforcement learning

  title={Many-objective stochastic path finding using reinforcement learning},
  author={Bentz Tozer and Thomas A. Mazzuchi and Shahram Sarkani},
  journal={Expert Syst. Appl.},
In this paper, we investigate solutions to path finding problems with many conflicting objectives, and introduce a new model-free many objective reinforcement learning algorithm, called Voting Q-learning, that is capable of finding a set of optimal policies in an initially unknown, stochastic environment with several conflicting objectives. Current methods for solving this type of problem rely on Pareto dominance to determine which actions are optimal, which decreases in effectiveness as the… CONTINUE READING
Recent Discussions
This paper has been referenced on Twitter 1 time over the past 90 days. VIEW TWEETS