Asynchronous Multi-Agent Bandits: Fully Distributed vs . Leader-Coordinated Algorithms

Wang, Xuchuang; Chen, Yu-Zhen Janice; Liu, Xutong; Yang, Lin; Hajiesmaili, Mohammad; Towsley, Don; Lui, John CS

doi:10.1145/3711696

Citation Details

This content will become publicly available on March 6, 2026

Asynchronous Multi-Agent Bandits: Fully Distributed vs . Leader-Coordinated Algorithms

We study the cooperative asynchronous multi-agent multi-armed bandits problem, where each agent's active (arm pulling) decision rounds are asynchronous. That is, in each round, only a subset of agents is active to pull arms, and this subset is unknown and time-varying. We consider two models of multi-agent cooperation, fully distributed and leader-coordinated, and propose algorithms for both models that attain near-optimal regret and communications bounds, both of which are almost as good as their synchronous counterparts. The fully distributed algorithm relies on a novel communication policy consisting of accuracy adaptive and on-demand components, and successive arm elimination for decision-making. For leader-coordinated algorithms, a single leader explores arms and recommends them to other agents (followers) to exploit. As agents' active rounds are unknown, a competent leader must be chosen dynamically. We propose a variant of the Tsallis-INF algorithm with low switches to choose such a leader sequence. Lastly, we report numerical simulations of our new asynchronous algorithms with other known baselines. more »

Award ID(s):: 2325956

PAR ID:: 10591421

Author(s) / Creator(s):: Wang, Xuchuang; Chen, Yu-Zhen Janice; Liu, Xutong; Yang, Lin; Hajiesmaili, Mohammad; Towsley, Don; Lui, John CS

Publisher / Repository:: ACM

Date Published:: 2025-03-06

Journal Name:: Proceedings of the ACM on Measurement and Analysis of Computing Systems

Volume:: 9

Issue:: 1

ISSN:: 2476-1249

Page Range / eLocation ID:: 1 to 39

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 6, 2026
Journal Article:
https://doi.org/10.1145/3711696

More Like this