Tae Yoon Chun

  • Citations Per Year
Learn More
In this paper, we investigate the mathematical properties of generalized policy iteration (GPI) applied to a class of continuous-time linear systems with unknown internal dynamics. GPI is a class of dynamic programming method to solve a optimal control problem by using two consecutive steps—policy evaluation and policy improvement. We first provide several(More)
This paper presents the properties of policy iteration (PI)-mode monotone convergence and stability of generalized policy iteration (OPI) algorithms for discrete-time (DT) linear systems. OPI is one of the reinforcement learning based dynamic programming (DP) methods for solving optimal control problems, interacting policy evaluation and policy improvement(More)
  • 1