Dotan Di Castro

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
In adaptive control, agents interacting with Markov Decision Processes typically face two types of setups. In the first setup, the environment's model is known and dynamic programming and related methods are used to obtain the optimal control. In the second setup, the environment's model is unknown and reinforcement learning methods are used. In this work(More)
  • 1