Mean-variance optimization of discrete time discounted Markov decision processes

  title={Mean-variance optimization of discrete time discounted Markov decision processes},
  author={Li Xia},
  • Li Xia
  • Published 2018
  • Mathematics, Computer Science
  • Autom.
In this paper, we study a mean-variance optimization problem in an infinite horizon discrete time discounted Markov decision process (MDP). The objective is to minimize the variance of system rewards with the constraint of mean performance. Different from most of works in the literature which require the mean performance already achieve optimum, we can let the mean discounted performance equal any constant. The difficulty of this problem is caused by the quadratic form of the variance function… Expand
Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance
  • L. Xia
  • Computer Science, Mathematics
  • ArXiv
  • 2020
A New Method for Mean-Variance Optimization of Stochastic Dynamic Systems
  • L. Xia, Zhen Yang
  • Computer Science
  • 2019 IEEE Conference on Control Technology and Applications (CCTA)
  • 2019
Variance-Based Risk Estimations in Markov Processes via Transformation with State Lumping
  • Shuai Ma, Jia Yuan Yu
  • Computer Science, Mathematics
  • 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC)
  • 2019
Modelling uncertainty in reinforcement learning
Multi-objective virtual network embedding algorithm based on Q-learning and curiosity-driven


Optimization of Markov decision processes under the variance criterion
  • Li Xia
  • Mathematics, Computer Science
  • Autom.
  • 2016
Variance minimization of parameterized Markov decision processes
Mean-Variance Criteria for Finite Continuous-Time Markov Decision Processes
On the First Passage g-Mean-Variance Optimality for Discounted Continuous-Time Markov Decision Processes
The risk probability criterion for discounted continuous-time Markov decision processes
Actor-Critic Algorithms for Risk-Sensitive MDPs
Mean-Variance Tradeoffs in an Undiscounted MDP
Sample-Path Optimality and Variance-Minimization of Average Cost Markov Control Processes
Constrained Markov Decision Processes
The $n$th-Order Bias Optimality for Multichain Markov Decision Processes
  • X. Cao, Junyu Zhang
  • Mathematics, Computer Science
  • IEEE Transactions on Automatic Control
  • 2008