Learn More
In this paper, we consider the model that the information on the rewards in vector-valued Markov decision processes includes imprecision or ambiguity. The fuzzy reward model is analyzed as follows: The fuzzy reward is represented by the fuzzy set on the multi-dimensional Euclidian space R p and the infinite horizon fuzzy expected discounted reward(FEDR)(More)
We consider utility-constrained Markov decision processes. The expected utility of the total discounted reward is maximized subject to multiple expected utility constraints. By introducing a corresponding Lagrange function, a saddle-point theorem of the utility constrained optimization is derived. The existence of a constrained optimal policy is(More)
In this paper, a Markov decision model with uncertain transition matrices, which allow a matrix to fluctuate at each step in time, is described by the use of fuzzy sets. We find a pareto optimal policy maximizing the infinite horizon fuzzy expected discounted reward over all stationary policies under some partial order. The pareto optimal policies are(More)
This paper discusses two topics on fuzzy random variables in decision making. One is a new evaluation method of fuzzy random variables, and the other is to present a mathematical model in financial engineering by fuzzy random variables. The evaluation method is introduced as mean values defined by fuzzy measures, and it is also applicable to fuzzy numbers(More)
We formulate a stopping problem for dynamic fuzzy systems concerning with fuzzy decision environment. It could be regarded as a natural fuzzification of non-fuzzy stopping problem with a deterministic dynamic system. The validity of the approach by α-cuts of fuzzy sets will be discussed in constructing One-step Look Ahead policy of an optimal fuzzy stopping(More)
In this paper, the average cases of Markov decision processes with uncertainty is considered. That is, a controlled Markov set-chain model with a finite state and action space is developed by an interval arithmetic analysis, and we will find a Pareto optimal policy which maximizes the average expected rewards over all stationary policies under a new partial(More)