Simulation Support for Fast and Accurate Large-Scale GPGPU & Accelerator Workloads

Ramadas, Vishnu; Poremba, Matthew; Beckmann, Bradford; Sinclair, Matthew D

Citation Details

In recent years deep neural networks (DNNs) have emerged as an important application domain driving the requirements for future systems. As DNNs get more sophisticated, their compute requirements and the datasets they are trained on continue to grow at a fast rate. For example, Gholami showed that compute in Transformer networks grew 750X over 2 years, while other work projects DNN compute and memory requirements to grow by 1.5X per year. Given their growing requirements and importance, heterogeneous systems often add machine learning (ML) specific features (e.g., TensorCores) to improve their efficiency. However, given ML’s voracious rate of growth and size, there is a growing challenge in performing early-system exploration based on sound simulation methodology. In this work we discuss our efforts to enhance gem5’s support to make these workloads practical to run while retaining accuracy. more »

Award ID(s):: 2311889

PAR ID:: 10542853

Author(s) / Creator(s):: Ramadas, Vishnu; Poremba, Matthew; Beckmann, Bradford; Sinclair, Matthew D

Publisher / Repository:: 3rd Open-Source Computer Architecture Research Workshop (OSCAR)

Date Published:: 2024-06-29

Subject(s) / Keyword(s):: simulation GPGPU ML gem5

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Workshop Report:
The DOI is not currently available.

More Like this