NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Single-Loop Federated Actor-Critic across Heterogeneous Environments

https://doi.org/10.1609/aaai.v39i21.34469

Zhu, Ye; Gong, Xiaowen (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Federated reinforcement learning (FRL) has emerged as a promising paradigm, enabling multiple agents to collaborate and learn a shared policy adaptable across heterogeneous environments. Among the various reinforcement learning (RL) algorithms, the actor-critic (AC) algorithm stands out for its low variance and high sample efficiency. However, little to nothing is known theoretically about AC in a federated manner, especially each agent interacts with a potentially different environment. The lack of such results is attributed to various technical challenges: a two-level structure illustrating the coupling effect between the actor and the critic, heterogeneous environments, Markovian sampling and multiple local updates. In response, we study Single-Loop Federated Actor Critic (SFAC) where agents perform AC learning in a two-level federated manner while interacting with heterogeneous environments. We then provide bounds on the convergence error of SFAC. The results show that the convergence error asymptotically converges to a near-stationary point, with the extent proportional to environment heterogeneity. Moreover, the sample complexity exhibits a linear speed-up through the federation of agents. We evaluate the performance of SFAC through numerical experiments using common RL benchmarks, which demonstrate its effectiveness.
more » « less
Free, publicly-accessible full text available April 11, 2026
Distributed Policy Gradient with Heterogeneous Computations for Federated Reinforcement Learning

https://doi.org/10.1109/CISS56502.2023.10089771

Zhu, Ye; Gong, Xiaowen (March 2023, 2023 57th Annual Conference on Information Sciences and Systems (CISS))

Full Text Available
Truthful Incentive Mechanism for Federated Learning with Crowdsourced Data Labeling

Zhao, Yuxi; Gong, Xiaowen; Mao, Shiwen (January 2023, Proceedings IEEE INFOCOM)

Full Text Available
Quality-Aware Distributed Computation for Cost-Effective Non-Convex and Asynchronous Wireless Federated Learning

https://doi.org/10.23919/WiOpt52861.2021.9589660

Zhao, Yuxi; Gong, Xiaowen (October 2021, 2021 19th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt))

Full Text Available
Quality-Aware Distributed Computation and Communication Scheduling for Fast Convergent Wireless Federated Learning

https://doi.org/10.23919/WiOpt52861.2021.9589802

Li, Dongsheng; Zhao, Yuxi; Gong, Xiaowen (October 2021, 2021 19th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt))

Full Text Available

Search for: All records