Corpus ID: 102350686

Reinforced Imitation in Heterogeneous Action Space

@article{Zolna2019ReinforcedII,
  title={Reinforced Imitation in Heterogeneous Action Space},
  author={Konrad Zolna and N. Rostamzadeh and Yoshua Bengio and Sungjin Ahn and Pedro H. O. Pinheiro},
  journal={ArXiv},
  year={2019},
  volume={abs/1904.03438}
}
  • Konrad Zolna, N. Rostamzadeh, +2 authors Pedro H. O. Pinheiro
  • Published 2019
  • Computer Science, Mathematics
  • ArXiv
  • Imitation learning is an effective alternative approach to learn a policy when the reward function is sparse. In this paper, we consider a challenging setting where an agent and an expert use different actions from each other. We assume that the agent has access to a sparse reward function and state-only expert observations. We propose a method which gradually balances between the imitation learning cost and the reinforcement learning objective. In addition, this method adapts the agent's… CONTINUE READING
    5 Citations

    Figures, Tables, and Topics from this paper.

    Explore Further: Topics Discussed in This Paper

    Positive-Unlabeled Reward Learning
    • 3
    • Highly Influenced
    • PDF
    ING IN LEARNING AGENTS
    Towards intervention-centric causal reasoning in learning agents
    Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
    • 16
    • PDF

    References

    SHOWING 1-10 OF 32 REFERENCES
    Internal Model from Observations for Reward Shaping
    • 9
    • PDF
    Generative Adversarial Imitation Learning
    • 883
    • Highly Influential
    • PDF
    Overcoming Exploration in Reinforcement Learning with Demonstrations
    • 238
    • PDF
    Third-Person Imitation Learning
    • 121
    • PDF
    Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
    • 136
    • PDF
    Reinforcement Learning from Imperfect Demonstrations
    • 91
    • PDF
    Apprenticeship learning via inverse reinforcement learning
    • 1,979
    • PDF
    Reinforcement and Imitation Learning for Diverse Visuomotor Skills
    • 135
    • PDF
    Policy Optimization with Demonstrations
    • 41
    • PDF
    Observational Learning by Reinforcement Learning
    • 19
    • PDF