Scalable spectral representations for multiagent reinforcement learning in network MDPs

Ren, Zhaolin; Zhang, Runyu; Dai, Bo; Li, Na

Citation Details

This content will become publicly available on May 3, 2026

Scalable spectral representations for multiagent reinforcement learning in network MDPs

Network Markov Decision Processes (MDPs), which are the de-facto model for multi-agent control, pose a significant challenge to efficient learning caused by the exponential growth of the global state-action space with the number of agents. In this work, utilizing the exponential decay property of network dynamics, we first derive scalable spectral local representations for multiagent reinforcement learning in network MDPs, which induces a network linear subspace for the local $$Q$$-function of each agent. Building on these local spectral representations, we design a scalable algorithmic framework for multiagent reinforcement learning in continuous state-action network MDPs, and provide end-to-end guarantees for the convergence of our algorithm. Empirically, we validate the effectiveness of our scalable representation-based approach on two benchmark problems, and demonstrate the advantages of our approach over generic function approximation approaches to representing the local $$Q$$-functions. more »

Award ID(s):: 2328241

PAR ID:: 10630949

Author(s) / Creator(s):: Ren, Zhaolin; Zhang, Runyu; Dai, Bo; Li, Na

Editor(s):: Li, Yingzhen; Mandt, Stephan; Agrawal, Shipra; Khan, Emtiyaz

Publisher / Repository:: Proceedings of Machine Learning Research

Date Published:: 2025-05-03

Volume:: 258

Page Range / eLocation ID:: 550-558

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on May 3, 2026
Conference Proceeding:
The DOI is not currently available.

More Like this