Yuenan Hou

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
Recently, a state-of-the-art algorithm, called deep deterministic policy gradient (DDPG), has achieved good performance in many continuous control tasks in the MuJoCo simulator. To further improve the efficiency of the experience replay mechanism in DDPG and thus speeding up the training process, in this paper, a prioritized experience replay method is(More)
  • 1