Corpus ID: 198897852

Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning

@article{Kartal2019TerminalPA,
  title={Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning},
  author={Bilal Kartal and Pablo Hernandez-Leal and Matthew E. Taylor},
  journal={ArXiv},
  year={2019},
  volume={abs/1907.10827}
}
  • Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor
  • Published 2019
  • Computer Science, Mathematics
  • ArXiv
  • Deep reinforcement learning has achieved great successes in recent years, but there are still open challenges, such as convergence to locally optimal policies and sample inefficiency. [...] Key Result Our results on Atari games and the BipedalWalker domain suggest that A3C-TP outperforms standard A3C in most of the tested domains and in others it has similar performance.Expand Abstract

    Figures and Topics from this paper.

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 43 REFERENCES
    Reinforcement Learning with Unsupervised Auxiliary Tasks
    598
    Deep Reinforcement Learning: A Brief Survey
    502
    Towards Sample Efficient Reinforcement Learning
    18
    Human-level control through deep reinforcement learning
    9336