Manasa Mandava

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
— In continuous-time Markov decision processes (CTMDPs) with Borel state and action spaces, unbounded transition rates, for an arbitrary policy, we construct a relaxed Markov policy such that the marginal distribution on the state-action pairs at any time instant is the same for both the policies. This result implies the existence of a relaxed Markov policy(More)
  • 1