• Corpus ID: 52896216

Interactive Agent Modeling by Learning to Probe

@article{Shu2018InteractiveAM,
  title={Interactive Agent Modeling by Learning to Probe},
  author={Tianmin Shu and Caiming Xiong and Ying Nian Wu and Song-Chun Zhu},
  journal={ArXiv},
  year={2018},
  volume={abs/1810.00510}
}
The ability of modeling the other agents, such as understanding their intentions and skills, is essential to an agent's interactions with other agents. Conventional agent modeling relies on passive observation from demonstrations. In this work, we propose an interactive agent modeling scheme enabled by encouraging an agent to learn to probe. In particular, the probing agent (i.e. a learner) learns to interact with the environment and with a target agent (i.e., a demonstrator) to maximize the… 
Towards Continual Reinforcement Learning: A Review and Perspectives
TLDR
A taxonomy of different continual RL formulations and mathematically characterize the non-stationary dynamics of each setting is provided, providing an overview of benchmarks used in the literature and important metrics for understanding agent performance.
Modeling Conceptual Understanding in Image Reference Games
TLDR
This work presents an image reference game between a speaker and a population of listeners where reasoning about the concepts other agents can comprehend is necessary and a model formulation with this capability, and suggests that the learner indeed encodes information directly pertaining to the understanding of other agents.

References

SHOWING 1-10 OF 41 REFERENCES
Learning with Opponent-Learning Awareness
TLDR
Results show that the encounter of two LOLA agents leads to the emergence of tit-for-tat and therefore cooperation in the iterated prisoners' dilemma, while independent learning does not, and LOLA also receives higher payouts compared to a naive learner, and is robust against exploitation by higher order gradient-based methods.
The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems
TLDR
This work distinguishes reinforcement learners that are unaware of (or ignore) the presence of other agents from those that explicitly attempt to learn the value of joint actions and the strategies of their counterparts, and proposes alternative optimistic exploration strategies that increase the likelihood of convergence to an optimal equilibrium.
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
TLDR
An algorithm is described, based on approximate best responses to mixtures of policies generated using deep reinforcement learning, and empirical game-theoretic analysis to compute meta-strategies for policy selection, which generalizes previous ones such as InRL.
Extending Q-Learning to General Adaptive Multi-Agent Systems
TLDR
This paper proposes a fundamentally different approach to Q-Learning, dubbed Hyper-Q, in which values of mixed strategies rather than base actions are learned, and in which other agents' strategies are estimated from observed actions via Bayesian inference.
Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning
TLDR
This paper proposes a novel framework for efficient multi-task reinforcement learning that trains agents to employ hierarchical policies that decide when to use a previously learned policy and when to learn a new skill.
Curiosity-Driven Exploration by Self-Supervised Prediction
TLDR
This work forms curiosity as the error in an agent's ability to predict the consequence of its own actions in a visual feature space learned by a self-supervised inverse dynamics model, which scales to high-dimensional continuous state spaces like images, bypasses the difficulties of directly predicting pixels, and ignores the aspects of the environment that cannot affect the agent.
One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
TLDR
This work presents an approach for one-shot learning from a video of a human by using human and robot demonstration data from a variety of previous tasks to build up prior knowledge through meta-learning, then combining this prior knowledge and only a single video demonstration from a human, the robot can perform the task that the human demonstrated.
A Comprehensive Survey of Multiagent Reinforcement Learning
TLDR
The benefits and challenges of MARL are described along with some of the problem domains where the MARL techniques have been applied, and an outlook for the field is provided.
Markov Games as a Framework for Multi-Agent Reinforcement Learning
...
...