Multi-Agent Meta-Reinforcement Learning: Sharper Convergence Rates with Task Similarity

Mao, W; Qiu, H; Wang, C; Franke, H; Kalbarczyk, Z; Iyer, R; Basar, T

Citation Details

Multi-agent reinforcement learning (MARL) has primarily focused on solving a single task in isolation, while in practice the environment is often evolving, leaving many related tasks to be solved. In this paper, we investigate the benefits of meta-learning in solving multiple MARL tasks collectively. We establish the first line of theoretical results for meta-learning in a wide range of fundamental MARL settings, including learning Nash equilibria in two-player zero-sum Markov games and Markov potential games, as well as learning coarse correlated equilibria in general-sum Markov games. Under natural notions of task similarity, we show that meta-learning achieves provable sharper convergence to various game-theoretical solution concepts than learning each task separately. As an important intermediate step, we develop multiple MARL algorithms with initialization-dependent convergence guarantees. Such algorithms integrate optimistic policy mirror descents with stage-based value updates, and their refined convergence guarantees (nearly) recover the best known results even when a good initialization is unknown. To our best knowledge, such results are also new and might be of independent interest. We further provide numerical simulations to corroborate our theoretical findings. more »

Award ID(s):: 2029049

PAR ID:: 10546464

Author(s) / Creator(s):: Mao, W; Qiu, H; Wang, C; Franke, H; Kalbarczyk, Z; Iyer, R; Basar, T

Corporate Creator(s):: NeurIPS

Editor(s):: Oh, A; Naumann, T; Globerson, A; Saenko, K; Hardt, M; Levine, S

Publisher / Repository:: NeurIPS

Date Published:: 2024-04-01

Edition / Version:: 1

Volume:: 1

Issue:: 1

ISBN:: 9781713899921

Subject(s) / Keyword(s):: multi-agent reinforcement learning, convergence, meta-learning

Format(s):: Medium: X Size: 284 kb Other: pdf

Size(s):: 284 kb

Location:: New Orleans, LA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this