Learning to Act Using Real-Time Dynamic Programming

@article{Barto1995LearningTA,
  title={Learning to Act Using Real-Time Dynamic Programming},
  author={Andrew G. Barto and Steven J. Bradtke and Satinder P. Singh},
  journal={Artif. Intell.},
  year={1995},
  volume={72},
  pages={81-138}
}
Learning methods based on dynamic programming (DP) are receiving increasing attention in arti cial intelligence. Researchers have argued that DP provides the appropriate basis for compiling planning results into reactive strategies for real-time control, as well as for learning such strategies when the system being controlled is incompletely known. We introduce an algorithm based on DP, which we call Real-Time DP (RTDP), by which an embedded system can improve its performance with experience… CONTINUE READING
BETA

Citations

Publications citing this paper.
SHOWING 1-10 OF 57 CITATIONS, ESTIMATED 37% COVERAGE

153 Citations

01020'95'00'06'12'18
Citations per Year
Semantic Scholar estimates that this publication has 153 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
SHOWING 1-10 OF 79 REFERENCES

Practical issues in temporal di erence learning

  • G. J. Tesauro
  • Machine Learning, 8:257{277,
  • 1992
Highly Influential
11 Excerpts

Similar Papers

Loading similar papers…