Corpus ID: 201319046

A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation

@inproceedings{Yang2019AGA,
  title={A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation},
  author={Runzhe Yang and Xingyuan Sun and Karthik Narasimhan},
  booktitle={NeurIPS},
  year={2019}
}
We introduce a new algorithm for multi-objective reinforcement learning (MORL) with linear preferences, with the goal of enabling few-shot adaptation to new tasks. In MORL, the aim is to learn policies over multiple competing objectives whose relative importance (preferences) is unknown to the agent. While this alleviates dependence on scalar reward design, the expected return of a policy can change significantly with varying preferences, making it challenging to learn a single model to produce… Expand
24 Citations
Provable Multi-Objective Reinforcement Learning with Generative Models
  • Highly Influenced
  • PDF
A Distributional View on Multi-Objective Policy Optimization
  • 5
  • Highly Influenced
  • PDF
A Practical Guide to Multi-Objective Reinforcement Learning and Planning
  • PDF
Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms
  • PDF
Effective Diversity in Population-Based Reinforcement Learning
  • 9
  • PDF
Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning
  • 1
  • Highly Influenced
  • PDF
...
1
2
3
...

References

SHOWING 1-10 OF 56 REFERENCES
Learning all optimal policies with multiple criteria
  • 119
  • PDF
The Steering Approach for Multi-Criteria Reinforcement Learning
  • 24
  • PDF
Dynamic preferences in multi-criteria reinforcement learning
  • 101
  • PDF
Tree-based Fitted Q-iteration for Multi-Objective Markov Decision problems
  • 21
  • PDF
Dynamic Weights in Multi-Objective Deep Reinforcement Learning
  • 23
  • Highly Influential
  • PDF
Manifold-based multi-objective policy search with sample reuse
  • 14
  • PDF
Multiobjective Reinforcement Learning: A Comprehensive Overview
  • C. Liu, X. Xu, D. Hu
  • Computer Science
  • IEEE Transactions on Systems, Man, and Cybernetics: Systems
  • 2015
  • 162
Continuous control with deep reinforcement learning
  • 4,494
  • PDF
Parallel reinforcement learning for weighted multi-criteria model with adaptive margin
  • 11
...
1
2
3
4
5
...