Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient
- Shihui Li, Yi Wu, Xinyue Cui, Honghua Dong, Fei Fang, Stuart J. Russell
- Computer ScienceAAAI Conference on Artificial Intelligence
- 17 July 2019
This paper proposes a new algorithm, MiniMax Multi-agent Deep Deterministic Policy Gradient (M3DDPG) with the following contributions: a minimax extension of the popular multi-agent deep deterministic policy gradient algorithm (MADDPG), for robust policy learning; and a Multi-Agent Adversarial Learning (MAAL) to efficiently solve the proposed formulation.
Artificial Intelligence for Social Good: A Survey
- Zheyuan Ryan Shi, Claire Wang, Fei Fang
- Computer SciencearXiv.org
- 7 January 2020
This work quantitatively analyzes the distribution and trend of the AI4SG literature in terms of application domains and AI techniques used and proposes three conceptual methods to systematically group the existing literature and analyze the eight AI4 SG application domains in a unified framework.
"A Game of Thrones": When Human Behavior Models Compete in Repeated Stackelberg Security Games
- Debarun Kar, Fei Fang, F. D. Fave, Nicole D. Sintov, Milind Tambe
- Computer ScienceAdaptive Agents and Multi-Agent Systems
- 4 May 2015
A new human behavior model, SHARP, which mitigates these three limitations as follows: (i) SHARP reasons based on success or failure of the adversary's past actions on exposed portions of the attack surface to model adversary adaptiveness; (ii)SHARP reasons about similarity between exposed and unexposed areas of the attacked surface, and incorporates a discounting parameter to mitigate adversary's lack of exposure to enough of theattack surface.
Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning
- Qian Long, Zihan Zhou, A. Gupta, Fei Fang, Yi Wu, Xiaolong Wang
- Computer ScienceInternational Conference on Learning…
- 23 March 2020
EPC is introduced, a curriculum learning paradigm that scales up Multi-Agent Reinforcement Learning (MARL) by progressively increasing the population of training agents in a stage-wise manner and uses an evolutionary approach to fix an objective misalignment issue throughout the curriculum.
When Security Games Go Green: Designing Defender Strategies to Prevent Poaching and Illegal Fishing
- Fei Fang, P. Stone, Milind Tambe
- Computer ScienceInternational Joint Conference on Artificial…
- 25 July 2015
Green Security Games (GSGs) is introduced, a novel game model for green security domains with a generalized Stackelberg assumption; algorithms to plan effective sequential defender strategies are provided; a novel approach to learn adversary models that further improves defender performance is proposed.
Deploying PAWS: Field Optimization of the Protection Assistant for Wildlife Security
- Fei Fang, T. Nguyen, A. Lemieux
- Computer ScienceAAAI Conference on Artificial Intelligence
- 12 February 2016
This paper reports on PAWS’s significant evolution from a proposed decision aid to a regularly deployed application, reporting on the lessons from the first tests in Africa in Spring 2014, through its continued evolution since then, to current regular use in Southeast Asia and plans for future worldwide deployment.
Stackelberg Security Games: Looking Beyond a Decade of Success
- Arunesh Sinha, Fei Fang, Bo An, Christopher Kiekintveld, Milind Tambe
- Computer ScienceInternational Joint Conference on Artificial…
- 1 July 2018
A broad survey of recent technical advances in Stackelberg Security Game and related literature is presented, and the future is highlighted by highlighting the new potential applications and open research problems in SSG.
Optimal patrol strategy for protecting moving targets with multiple mobile resources
- Fei Fang, A. Jiang, Milind Tambe
- Computer ScienceAdaptive Agents and Multi-Agent Systems
- 6 May 2013
This paper focuses on protecting mobile targets that lead to a continuous set of strategies for the players, motivated by several real-world domains including protecting ferries with escorts and protecting refugee supply lines.
What game are we playing? End-to-end learning in normal and extensive form games
- Chun Kai Ling, Fei Fang, J. Z. Kolter
- Computer ScienceInternational Joint Conference on Artificial…
- 7 May 2018
A differentiable, end-to-end learning framework for addressing the relatively under-explored but equally important "inverse" setting, where the parameters of the underlying game are not known to all agents, but must be learned through observations.
Deep Reinforcement Learning for Green Security Games with Real-Time Information
- Yufei Wang, Zheyuan Ryan Shi, Fei Fang
- Computer ScienceAAAI Conference on Artificial Intelligence
- 6 November 2018
This work designs a novel deep reinforcement learning-based algorithm, DeDOL, to compute a patrolling strategy that adapts to the real-time information against a best-responding attacker, and is the first attempt to use Deep Q-Learning for security games.
...
...