Function Approximation for Solving Stackelberg Equilibrium in Large Perfect Information Games

Ling, Chun Kai; Kolter, J. Zico; Fang, Fei

doi:10.1609/aaai.v37i5.25715

Citation Details

This content will become publicly available on June 27, 2024

Function Approximation for Solving Stackelberg Equilibrium in Large Perfect Information Games

Function approximation (FA) has been a critical component in solving large zero-sum games. Yet, little attention has been given towards FA in solving general-sum extensive-form games, despite them being widely regarded as being computationally more challenging than their fully competitive or cooperative counterparts. A key challenge is that for many equilibria in general-sum games, no simple analogue to the state value function used in Markov Decision Processes and zero-sum games exists. In this paper, we propose learning the Enforceable Payoff Frontier (EPF)---a generalization of the state value function for general-sum games. We approximate the optimal Stackelberg extensive-form correlated equilibrium by representing EPFs with neural networks and training them by using appropriate backup operations and loss functions. This is the first method that applies FA to the Stackelberg setting, allowing us to scale to much larger games while still enjoying performance guarantees based on FA error. Additionally, our proposed method guarantees incentive compatibility and is easy to evaluate without having to depend on self-play or approximate best-response oracles.

Award ID(s):: 2046640

NSF-PAR ID:: 10490632

Author(s) / Creator(s):: Ling, Chun Kai; Kolter, J. Zico; Fang, Fei

Publisher / Repository:: AAAI

Date Published:: 2023-06-27

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 37

Issue:: 5

ISSN:: 2159-5399

Page Range / eLocation ID:: 5764 to 5772

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on June 27, 2024
Journal Article:
https://doi.org/10.1609/aaai.v37i5.25715

More Like this