Accelerating Multi-Agent DDPG on CPU-FPGA Heterogeneous Platform

Wiggins, Samuel; Meng, Yuan; Kannan, Rajgopal; Prasanna, Viktor

doi:10.1109/HPEC58863.2023.10363567

Citation Details

Accelerating Multi-Agent DDPG on CPU-FPGA Heterogeneous Platform

Multi-Agent Reinforcement Learning (MARL) is a key technology in artificial intelligence applications such as robotics, surveillance, energy systems, etc. Multi-Agent Deep Deterministic Policy Gradient (MADDPG) is a state-of-the-art MARL algorithm that has been widely adopted and considered a popular baseline for novel MARL algorithms. However, existing implementations of MADDPG on CPU and CPU-GPU platforms do not exploit fine-grained parallelism between cooperative agents and handle inter-agent communication sequentially, leading to sub-optimal throughput performance in MADDPG training. In this work, we develop the first high-throughput MADDPG accelerator on a CPU-FPGA heterogeneous platform. Specifically, we develop dedicated hardware modules that enable parallel training of each agent's internal Deep Neural Networks (DNNs) and support low-latency inter-agent communication using an on-chip agent interconnection network. Our experimental results show that the speed performance of agent neural network training improves by a factor of 3.6×−24.3× and 1.5×−29.5× compared with state-of-the-art CPU and CPU-GPU implementations. Our design achieves up to a 1.99× and 1.93× improvement in overall system throughput compared with CPU and CPU-GPU implementations, respectively. more »

Award ID(s):: 2209563 2104264

PAR ID:: 10513024

Author(s) / Creator(s):: Wiggins, Samuel; Meng, Yuan; Kannan, Rajgopal; Prasanna, Viktor

Publisher / Repository:: IEEE

Date Published:: 2023-09-25

ISBN:: 979-8-3503-0860-0

Page Range / eLocation ID:: 1 to 7

Format(s):: Medium: X

Location:: Boston, MA, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/HPEC58863.2023.10363567

More Like this