• Corpus ID: 238583583

Convex-Concave Min-Max Stackelberg Games

  title={Convex-Concave Min-Max Stackelberg Games},
  author={Denizalp Goktas and Amy Greenwald},
  booktitle={Neural Information Processing Systems},
Min-max optimization problems (i.e., min-max games) have been attracting a great deal of attention because of their applicability to a wide range of machine learning problems. Although significant progress has been made recently, the literature to date has focused on games with independent strategy sets; little is known about solving games with dependent strategy sets, which can be interpreted as min-max Stackelberg games. We introduce two first-order methods that solve a large class of convex… 

Figures and Tables from this paper

Zero-Sum Stochastic Stackelberg Games

This paper proves the existence of recursive (i.e., Markov perfect) Stackelberg equilibria (recSE) in zero-sum stochastic games, provides necessary and sufficient conditions for a policy to be a recSE, and shows that recSE can be computed in (weakly) polynomial time via value iteration.

Robust No-Regret Learning in Min-Max Stackelberg Games

This paper investigates the behavior of no-regret learning in min-max games with dependent strategy sets, and shows that if both players minimize their Lagrangian regrets, then play converges to a Stackelberg equilibrium.

Gradient Descent Ascent in Min-Max Stackelberg Games

It is shown that solving Fisher markets, a canonical example of a min-max Stackelberg game, using a novel algorithm, corresponds to buyers and sellers using myopic best-response dynamics in a repeated market, allowing the convergence of these dynamics in 𝑂 ( 1 / 𝜀 2 ) iterations in Fisher markets.

Exploitability Minimization in Games and Beyond

The exploitability-minimization problem can be recast as a min-max optimization problem, and polynomial-time first-order methods are obtained to compute a refinement of GNE, namely the variational equilibria (VE), in convex-concave cumulative regret pseudo-games with jointly convex constraints.

Synthesizing Reactive Test Environments for Autonomous Systems: Testing Reach-Avoid Specifications with Multi-Commodity Flows

An optimization problem, framed as a multi- commodity network flow problem, that solves for constraints on the virtual product graph which can then be projected to the test environment and the result of the optimization problem is reactive test synthesis that ensures that the system meets the test specifications along with satisfying the system speci-cations.

Learning Stackelberg Equilibria and Applications to Economic Design Games

We study the use of reinforcement learning to learn the optimal leader’s strategy in Stackelberg games. Learning a leader’s strategy has an innate stationarity problem—when optimizing the leader’s

Learning Autonomous Vehicle Safety Concepts from Demonstrations

A data-driven AV safety design methodology is proposed that first learns “reasonable” behavioral assumptions from data, and then synthesizes an AV safety concept using these learned behavioral assumptions.

Examining Responsibility and Deliberation in AI Impact Statements and Ethics Reviews

The artificial intelligence research community is continuing to grapple with the ethics of its work by encouraging researchers to discuss potential positive and negative consequences. Neural

A Survey of Decision Making in Adversarial Games

This paper provides a systematic survey on three main game models widely employed in adversarial games, i.e., zero-sum normal-form and extensive-form games, Stackelberg (security) games, zero- sum differential games, from an array of perspectives, including basic knowledge of game models, (approximate) equilibrium concepts, problem classifications, research frontiers, (assumed) optimal strategy seeking techniques, prevailing algorithms, and practical applications.

Minimally Constrained Testing for Autonomy with Temporal Logic Specifications

This paper considers a subset of Linear Temporal Logic to represent formal requirements on the system and the test environment, and presents a framework to construct a minimally constrained test.



Near-Optimal Algorithms for Minimax Optimization

The first algorithm with $\tilde{O}(\sqrt{\kappa_{\mathbf x}\kappa- y}})$ gradient complexity is presented, matching the lower bound up to logarithmic factors.

On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems

This is the first nonasymptotic analysis for two-time-scale GDA in this setting, shedding light on its superior practical performance in training generative adversarial networks (GANs) and other real applications.

Block Alternating Optimization for Non-convex Min-max Problems: Algorithms and Applications in Signal Processing and Communications

This work proposes two simple algorithms, which alternatingly perform one gradient descent-type step for each minimization block and one gradient ascent- type step for the maximization problem, and shows that such simple alternating min-max algorithms converge to first-order stationary solutions.

Variational Analysis

Tatonnement beyond gross substitutes?: gradient descent to the rescue

A class of markets for which tatonnement is equivalent to gradient descent is defined and all processes in this family converge to an equilibrium for any Convex Potential Function (CPF) market.

Minimax Problems with Coupled Linear Constraints: Computational Complexity, Duality and Solution Methods

This work studies a special minimax problem where there are linear constraints that couple both the minimization and maximization decision variables, and shows that the considered problem is challenging, in the sense that it violates the classical max-min inequality.

What is Local Optimality in Nonconvex-Nonconcave Minimax Optimization?

A proper mathematical definition of local optimality for this sequential setting---local minimax is proposed, as well as its properties and existence results are presented.

Generalized Nash Equilibrium Problems

The Generalized Nash Equilibrium Problem is an important model that has its roots in the economic sciences but is being fruitfully used in many different fields and its main properties and solution algorithms are discussed.


A. Wald has presented a model of production and a model of exchange and proofs of the existence of an equilibrium for each of them. Here proofs of the existence of an equilibrium are given for an

Envelope Theorems for Arbitrary Choice Sets

The standard envelope theorems apply to choice sets with convex and topological structure, providing sufficient conditions for the value function to be differentiable in a parameter and