Koichi Nakade

Suggest Changes
Learn More
Undiscounted Markov decision processes (UMDP’s) can formulate optimal stochastic control problems that minimize the expected total cost per period for various systems. We propose new approximate(More)