Towards Improved Power Management in Cloud GPUs

Patel, Pratyush; Gong, Zibo; Rizvi, Syeda; Choukse, Esha; Misra, Pulkit; Anderson, Thomas; Sriraman, Akshitha

doi:10.1109/LCA.2023.3278652

Citation Details

Towards Improved Power Management in Cloud GPUs

As modern server GPUs are increasingly power intensive, better power management mechanisms can significantly reduce the power consumption, capital costs, and carbon emissions in large cloud datacenters. This letter uses diverse datacenter workloads to study the power management capabilities of modern GPUs. We find that current GPU management mechanisms have limited compatibility and monitoring support under cloud virtualization. They have sub-optimal, imprecise, and non-intuitive implementations of Dynamic Voltage and Frequency Scaling (DVFS) and power capping. Consequently, efficient GPU power management is not widely deployed in clouds today. To address these issues, we make actionable recommendations for GPU vendors and researchers. more »

Award ID(s):: 2104548

PAR ID:: 10505207

Author(s) / Creator(s):: Patel, Pratyush; Gong, Zibo; Rizvi, Syeda; Choukse, Esha; Misra, Pulkit; Anderson, Thomas; Sriraman, Akshitha

Publisher / Repository:: IEEE

Date Published:: 2023-07-01

Journal Name:: IEEE Computer Architecture Letters

Volume:: 22

Issue:: 2

ISSN:: 1556-6056

Page Range / eLocation ID:: 141 to 144

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/LCA.2023.3278652

More Like this