TD Models: Modeling the World at a Mixture of Time Scales

@inproceedings{Sutton1995TDMM,
  title={TD Models: Modeling the World at a Mixture of Time Scales},
  author={Richard S. Sutton},
  booktitle={ICML},
  year={1995}
}
Temporal-diierence (TD) learning can be used not just to predict rewards, as is commonly done in reinforcement learning, but also to predict states, i.e., to learn a model of the world's dynamics. We present theory and algorithms for intermixing TD models of the world at diierent levels of temporal abstraction within a single structure. Such multi-scale TD models can be used in model-based reinforcement-learning architectures and dynamic programming methods in place of conventional Markov… CONTINUE READING
95 Citations
26 References
Similar Papers

Similar Papers

Loading similar papers…