Asynchronous Upper Confidence Bound Algorithms for Federated Linear Bandits

Li, Chuanhao; Wang, Hongning

Citation Details

Linear contextual bandit is a popular online learning problem. It has been mostly studied in centralized learning settings. With the surging demand of large-scale decentralized model learning, e.g., federated learning, how to retain regret minimization while reducing communication cost becomes an open challenge. In this paper, we study linear contextual bandit in a federated learning setting. We propose a general framework with asynchronous model update and communication for a collection of homogeneous clients and heterogeneous clients, respectively. Rigorous theoretical analysis is provided about the regret and communication cost under this distributed learning framework; and extensive empirical evaluations demonstrate the effectiveness of our solution. more »

Award ID(s):: 1838615 2128019 1553568

PAR ID:: 10381225

Author(s) / Creator(s):: Li, Chuanhao; Wang, Hongning

Editor(s):: Camps-Valls, Gustau; Ruiz, Francisco J.; Valera, Isabel

Date Published:: 2022-03-28

Journal Name:: Proceedings of The 25th International Conference on Artificial Intelligence and Statistics

Volume:: 151

Page Range / eLocation ID:: 6529-6553

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this