Encouraging Inferable Behavior for Autonomy: Repeated Bimatrix Stackelberg Games with Observations

Karabag, Mustafa O; Smith, Sophia; Fridovich-Keil, David; Topcu, Ufuk

doi:10.23919/ACC60939.2024.10644936

Citation Details

Encouraging Inferable Behavior for Autonomy: Repeated Bimatrix Stackelberg Games with Observations

When interacting with other non-competitive decision-making agents, it is critical for an autonomous agent to have inferable behavior: Their actions must convey their intention and strategy. For example, an autonomous car's strategy must be inferable by the pedestrians interacting with the car. We model the inferability problem using a repeated bimatrix Stackelberg game with observations where a leader and a follower repeatedly interact. During the interactions, the leader uses a fixed, potentially mixed strategy. The follower, on the other hand, does not know the leader's strategy and dynamically reacts based on observations that are the leader's previous actions. In the setting with observations, the leader may suffer from an inferability loss, i.e., the performance compared to the setting where the follower has perfect information of the leader's strategy. We show that the inferability loss is upper-bounded by a function of the number of interactions and the stochasticity level of the leader's strategy, encouraging the use of inferable strategies with lower stochasticity levels. As a converse result, we also provide a game where the required number of interactions is lower bounded by a function of the desired inferability loss. more »

Award ID(s):: 2211432 2211548

PAR ID:: 10585017

Author(s) / Creator(s):: Karabag, Mustafa O; Smith, Sophia; Fridovich-Keil, David; Topcu, Ufuk

Publisher / Repository:: IEEE

Date Published:: 2024-07-10

Journal Name:: Proceedings of the American Control Conference

ISSN:: 2378-5861

ISBN:: 979-8-3503-8265-5

Page Range / eLocation ID:: 360 to 366

Format(s):: Medium: X

Location:: Toronto, ON, Canada

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.23919/ACC60939.2024.10644936

More Like this