Model-Ensemble Trust-Region Policy Optimization

@article{Kurutach2018ModelEnsembleTP,
  title={Model-Ensemble Trust-Region Policy Optimization},
  author={Thanard Kurutach and Ignasi Clavera and Yan Duan and Aviv Tamar and Pieter Abbeel},
  journal={CoRR},
  year={2018},
  volume={abs/1802.10592}
}
Model-free reinforcement learning (RL) methods are succeeding in a growing number of tasks, aided by recent advances in deep learning. However, they tend to suffer from high sample complexity which hinders their use in real-world domains. Alternatively, model-based reinforcement learning promises to reduce sample complexity, but tends to require careful tuning and, to date, it has succeeded mainly in restrictive domains where simple models are sufficient for learning. In this paper, we analyze… CONTINUE READING

Citations

Publications citing this paper.
Showing 1-10 of 12 extracted citations

References

Publications referenced by this paper.
Showing 1-10 of 47 references

Similar Papers

Loading similar papers…