Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning

Xueguang Lyu, Yuchen Xiao

Citation Details

Centralized Training for Decentralized Execution, where agents are trained offline using centralized information but execute in a decentralized manner online, has gained popularity in the multi-agent reinforcement learning community. In particular, actor-critic methods with a centralized critic and decentralized actors are a common instance of this idea. However, the implications of using a centralized critic in this context are not fully discussed and understood even though it is the standard choice of many algorithms. We therefore formally analyze centralized and decentralized critic approaches, providing a deeper understanding of the implications of critic choice. Because our theory makes unrealistic assumptions, we also empirically compare the centralized and decentralized critic methods over a wide set of environments to validate our theories and to provide practical advice. We show that there exist misconceptions regarding centralized critics in the current literature and show that the centralized critic design is not strictly beneficial, but rather both centralized and decentralized critics have different pros and cons that should be taken into account by algorithm designers more »

Award ID(s):: 2044993

PAR ID:: 10315762

Author(s) / Creator(s):: Xueguang Lyu, Yuchen Xiao

Date Published:: 2021-01-01

Journal Name:: Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems

ISSN:: 1558-2914

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this