A Bayesian Approach for Policy Learning from Trajectory Preference Queries

@inproceedings{Wilson2012ABA,
  title={A Bayesian Approach for Policy Learning from Trajectory Preference Queries},
  author={Aaron Wilson and Alan Fern and Prasad Tadepalli},
  booktitle={NIPS},
  year={2012}
}
We consider the problem of learning control policies via trajectory preference queries to an expert. In particular, the agent presents an expert with short runs of a pair of policies originating from the same state and the expert indicates which trajectory is preferred. The agent’s goal is to elicit a latent target policy from the expert with as few queries as possible. To tackle this problem we propose a novel Bayesian model of the querying process and introduce two methods that exploit this… CONTINUE READING
Highly Cited
This paper has 63 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

Citations

Publications citing this paper.
Showing 1-10 of 37 extracted citations

64 Citations

01020'13'14'15'16'17'18
Citations per Year
Semantic Scholar estimates that this publication has 64 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-10 of 21 references

Hybrid monte carlo

  • Simon Duane, A. D. Kennedy, Brian J. Pendleton, Duncan Roweth
  • Physics Letters B,
  • 1987
Highly Influential
8 Excerpts

Jordan . An introduction to mcmc for machine learning

  • Brenna D. Argall, Sonia Chernova, Manuela Veloso, Brett Browning
  • Machine Learning
  • 2003

Similar Papers

Loading similar papers…