Skip to search formSkip to main contentSkip to account menu
You are currently offline. Some features of the site may not work correctly.

Markov decision process

Known as: Value iteration, Policy iteration, Markov decision problems 
Markov decision processes (MDPs) provide a mathematical framework for modeling decision making in situations where outcomes are partly random and… 
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2005
Highly Cited
2005
Optimal solutions to Markov decision problems may be very sensitive with respect to the state transition probabilities. In many… 
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Highly Cited
2004
Highly Cited
2004
A critical issue for the application of Markov decision processes (MDPs) to realistic problems is how the complexity of planning… 
  • figure 1
  • figure 2
Highly Cited
2004
Highly Cited
2004
Formal treatment of collaborative multi-agent systems has been lagging behind the rapid progress in sequential decision making by… 
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Highly Cited
2002
Highly Cited
2002
The bandit problem is revisited and considered under the PAC model. Our main contribution in this part is to show that given n… 
Highly Cited
2001
Highly Cited
2001
  • F. Jensen
  • Statistics for Engineering and Information…
  • 2001
  • Corpus ID: 42979791
Probabilistic graphical models and decision graphs are powerful modeling tools for reasoning and decision making under… 
  • figure 1.1
  • figure 1.2
  • figure 1.3
Highly Cited
1999
Highly Cited
1999
INTRODUCTION Examples of Constrained Dynamic Control Problems On Solution Approaches for CMDPs with Expected Costs Other Types of… 
  • figure 8.1
  • figure 8.2
  • figure 8.3
Highly Cited
1996
Highly Cited
1996
1 Introduction.- 1.0 Background.- 1.1 Raison d'Etre and Limitations.- 1.2 A Menu of Courses and Prerequisites.- 1.3 For the… 
Highly Cited
1994
Highly Cited
1994
  • M. Puterman
  • Wiley Series in Probability and Statistics
  • 1994
  • Corpus ID: 122678161
From the Publisher: The past decade has seen considerable theoretical and applied research on Markov decision processes, as well… 
Highly Cited
1987
Highly Cited
1987
We investigate the complexity of the classical problem of optimal policy computation in Markov decision processes. All three… 
Highly Cited
1986
Highly Cited
1986
Publisher Summary This chapter summarizes the ability of the models to track the shift in departure rates induced by the 1982… 
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5