Incremental Natural Actor-Critic Algorithms

    Abstract

    We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning methods are online approximations to policy iteration in which the value-function parameters are estimated using temporal difference learning and the policy parameters are updated… (More)

    Topics

    Cite this paper

    @inproceedings{IncrementalNA, title={Incremental Natural Actor-Critic Algorithms}, author={} }