Communication-Free Two-Stage Multi-Agent DDPG under Partial States and Observations

Cho, Joohyun; Liu, Mingxi; Zhou, Yi; Chen, Rong-Rong

doi:10.1109/IEEECONF53345.2021.9723197

Citation Details

Communication-Free Two-Stage Multi-Agent DDPG under Partial States and Observations

In this work, we propose a two-stage multi-agent deep deterministic policy gradient (TS-MADDPG) algorithm for communication-free, multi-agent reinforcement learning (MARL) under partial states and observations. In the first stage, we train prototype actor-critic networks using only partial states at actors. In the second stage, we incorporate partial observations resulting from prototype actions as side information at actors to enhance actor-critic training. This side information is useful to infer the unobserved states and hence, can help reduce the performance gap between a network with fully observable states and a partially observable one. Using a case study of building energy control in the power distribution network, we successfully demonstrate that the proposed TS-MADDPG can greatly improve the performance of single-stage MADDPG algorithms that use partial states only. This is the first work that utilizes partial local voltage measurements as observations to improve the MARL performance for a distributed power network. more »

Award ID(s):: 1817154

PAR ID:: 10393715

Author(s) / Creator(s):: Cho, Joohyun; Liu, Mingxi; Zhou, Yi; Chen, Rong-Rong

Date Published:: 2022-03-04

Journal Name:: 2021 55th Asilomar Conference on Signals, Systems, and Computers

Page Range / eLocation ID:: 459 to 463

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/IEEECONF53345.2021.9723197

More Like this