Chiplet-Gym: Optimizing Chiplet-Based AI Accelerator Design With Reinforcement Learning

Mishty, Kaniz  (ORCID:0009000346236940); Sadi, Mehdi  (ORCID:0000000298203695)

doi:10.1109/TC.2024.3457740

Citation Details

Chiplet-Gym: Optimizing Chiplet-Based AI Accelerator Design With Reinforcement Learning

Not AvailableModern Artificial Intelligence (AI) workloads demand computing systems with large silicon area to sustain throughput and competitive performance. However, prohibitive manufacturing costs and yield limitations at advanced tech nodes and die-size reaching the reticle limit restrain us from achieving this. With the recent innovations in advanced packaging technologies, chiplet-based architectures have gained significant attention in the AI hardware domain. However, the vast design space of chiplet-based AI accelerator design and the absence of system and package-level co-design methodology make it difficult for the designer to find the optimum design point regarding Power, Performance, Area, and manufacturing Cost (PPAC). This paper presents Chiplet-Gym, a Reinforcement Learning (RL)-based optimization framework to explore the vast design space of chiplet-based AI accelerators, encompassing the resource allocation, placement, and packaging architecture. We analytically model the PPAC of the chiplet-based AI accelerator and integrate it into an OpenAI gym environment to evaluate the design points. We also explore non-RL-based optimization approaches and combine these two approaches to ensure the robustness of the optimizer. The optimizer-suggested design point achieves 1.52× throughput, 0.27× energy, and 0.89× cost of its monolithic counterpart at iso-area. more »

Award ID(s):: 2153394

PAR ID:: 10638459

Author(s) / Creator(s):: Mishty, Kaniz ; Sadi, Mehdi

Publisher / Repository:: IEEE

Date Published:: 2025-01-01

Journal Name:: IEEE Transactions on Computers

Volume:: 74

Issue:: 1

ISSN:: 0018-9340

Page Range / eLocation ID:: 43 to 56

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TC.2024.3457740

More Like this