Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 234,975,949 papers from all fields of science
Search
Sign In
Create Free Account
Markov decision process
Known as:
Value iteration
, Policy iteration
, Markov decision problems
Expand
Markov decision processes (MDPs) provide a mathematical framework for modeling decision making in situations where outcomes are partly random and…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
49 relations
Apprenticeship learning
Artificial neural network
Automatic control
Backward induction
Expand
Broader (2)
Dynamic programming
Stochastic control
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2012
2012
Computing Game Metrics on Markov Decision Processes
Hongfei Fu
International Colloquium on Automata, Languages…
2012
Corpus ID: 11528801
In this paper we study the complexity of computing the game bisimulation metric defined by de Alfaro et al. on Markov Decision…
Expand
2008
2008
Hybrid ARQ-random network coding for wireless media streaming
Dong Nguyen
,
Tuan Tran
,
Thinh P. Q. Nguyen
,
B. Bose
Second International Conference on Communications…
2008
Corpus ID: 2099460
This paper proposes hybrid ARQ-random network coding frameworks for real-time media broadcast over single-hop wireless networks…
Expand
2006
2006
Determining the Optimal Software Rejuvenation Schedule via Semi-Markov Decision Process
Hiroyuki Eto
,
T. Dohi
2006
Corpus ID: 62202
Software rejuvenation is a preventive and proactive maintenance policy that is particularly useful for counteracting the…
Expand
Highly Cited
2003
Highly Cited
2003
Dynamic Programming and Time-Varying Delay Systems
B. Lincoln
2003
Corpus ID: 60337047
This thesis is divided into two separate parts. The first part is about Dynamic Programming for non-trivial optimal control…
Expand
2003
2003
Markov decision processes with fuzzy rewards
M. Kurano
,
M. Yasuda
,
J. Nakagami
,
Y. Yoshida
2003
Corpus ID: 9228601
In this paper, we consider the model that the information on the rewards in vector-valued Markov decision processes includes…
Expand
2002
2002
First-Order Markov Decision Processes
Matthew Greig mgreig
2002
Corpus ID: 17143749
Markov Decision Processes (MDPs) [7] have developed lately as a standard method for representing uncertainty in decision…
Expand
1998
1998
Theoretical Results on Reinforcement Learning with Temporally Abstract Behaviors
BehaviorsDoina Precup
,
R. Sutton
,
Satinder Singh
1998
Corpus ID: 14007696
We present new theoretical results on planning within the framework of temporally abstract reinforcement learning (Precup & Sut…
Expand
1991
1991
Average Cost Markov Decision Processes
Optimality Conditions
,
J. Hennet
,
A. Lasserre
1991
Corpus ID: 124121972
(i.e., a Bore1 subset of a complete separable metric space), most of the available results impose on the MDP very restrictive…
Expand
1986
1986
Markov decision drift processes
F. D. D. Schouten
1986
Corpus ID: 124922742
In Markov decision theory we distinguish (a) discrete-time Markov decision processes (b) semi-Markov decision…
Expand
1969
1969
Perturbation Theory and Undiscounted Markov Renewal Programming
P. Schweitzer
Operational Research
1969
Corpus ID: 35962482
A recently-developed perturbation formalism for finite Markov chains is used here to analyze the policy iteration algorithm for…
Expand