Online Learning in Stackelberg Games with an Omniscient Follower

Zhao, Geng; Zhu, Banghua; Jiao, Jiantao; Jordan, Michael

Citation Details

We study the problem of online learning in a two-player decentralized cooperative Stackelberg game. In each round, the leader first takes an action, followed by the follower who takes their action after observing the leader’s move. The goal of the leader is to learn to minimize the cumulative regret based on the history of interactions. Differing from the traditional formulation of repeated Stackelberg games, we assume the follower is omniscient, with full knowledge of the true reward, and that they always best-respond to the leader’s actions. We analyze the sample complexity of regret minimization in this repeated Stackelberg game. We show that depending on the reward structure, the existence of the omniscient follower may change the sample complexity drastically, from constant to exponential, even for linear cooperative Stackelberg games. more »

Award ID(s):: 1909499 2211209 1901252

PAR ID:: 10472682

Author(s) / Creator(s):: Zhao, Geng; Zhu, Banghua; Jiao, Jiantao; Jordan, Michael

Publisher / Repository:: Proceedings of Machine Learning Research

Date Published:: 2023-08-01

Journal Name:: Proceedings of Machine Learning Research

ISSN:: 2640-3498

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this