Pure Stationary Optimal Strategies in Markov Decision Processes

  title={Pure Stationary Optimal Strategies in Markov Decision Processes},
  author={Hugo Gimbert},
Markov decision processes (MDPs) are controllable discrete event systems with stochastic transitions. Performances of an MDP are evaluated by a payoff function. The controller of the MDP seeks to optimize those performances, using optimal strategies. There exists various ways of measuring performances, i.e. various classes of payoff functions. For example, average performances can be evaluated by a mean-payoff function, peak performances by a limsup payoff function, and the parity payoff… CONTINUE READING

From This Paper

Topics from this paper.

Similar Papers

Loading similar papers…