Learning from Limited Demonstrations

@inproceedings{Kim2013LearningFL,
  title={Learning from Limited Demonstrations},
  author={Beomjoon Kim and Amir-massoud Farahmand and Joelle Pineau and Doina Precup},
  booktitle={NIPS},
  year={2013}
}
We propose a Learning from Demonstration (LfD) algorithm which leverages expert data, even if they are very few or inaccurate. We achieve this by using both expert data, as well as reinforcement signals gathered through trial-and-error interactions with the environment. The key idea of our approach, Approximate Policy Iteration with Demonstration (APID), is that expert’s suggestions are used to define linear constraints which guide the optimization performed by Approximate Policy Iteration. We… CONTINUE READING
Highly Cited
This paper has 30 citations. REVIEW CITATIONS
19 Citations
29 References
Similar Papers

Similar Papers

Loading similar papers…