A Bayesian Approach for Policy Learning from Trajectory Preference Queries

  title={A Bayesian Approach for Policy Learning from Trajectory Preference Queries},
  author={Aaron Wilson and Alan Fern and Prasad Tadepalli},
We consider the problem of learning control policies via trajectory preference queries to an expert. In particular, the agent presents an expert with short runs of a pair of policies originating from the same state and the expert indicates which trajectory is preferred. The agent’s goal is to elicit a latent target policy from the expert with as few queries as possible. To tackle this problem we propose a novel Bayesian model of the querying process and introduce two methods that exploit this… CONTINUE READING
Highly Cited
This paper has 63 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 37 extracted citations

64 Citations

Citations per Year
Semantic Scholar estimates that this publication has 64 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 21 references

Hybrid monte carlo

  • Simon Duane, A. D. Kennedy, Brian J. Pendleton, Duncan Roweth
  • Physics Letters B,
  • 1987
Highly Influential
8 Excerpts

Jordan . An introduction to mcmc for machine learning

  • Brenna D. Argall, Sonia Chernova, Manuela Veloso, Brett Browning
  • Machine Learning
  • 2003

Similar Papers

Loading similar papers…