Hardware Compute Partitioning on NVIDIA GPUs*

Bakita, Joshua; Anderson, James H.

doi:10.1109/RTAS58335.2023.00012

Citation Details

Hardware Compute Partitioning on NVIDIA GPUs*

Embedded and autonomous systems are increasingly integrating AI/ML features, often enabled by a hardware accelerator such as a GPU. As these workloads become increasingly demanding, but size, weight, power, and cost constraints remain unyielding, ways to increase GPU capacity are an urgent need. In this work, we provide a means by which to spatially partition the computing units of NVIDIA GPUs transparently, allowing oft-idled capacity to be reclaimed via safe and effcient GPU sharing. Our approach works on any NVIDIA GPU since 2013, and can be applied via our easy-to-use, user-space library titled libsmctrl. We back the design of our system with deep investigations into the hardware scheduling pipeline of NVIDIA GPUs. We provide guidelines for the use of our system, and demonstrate it via an object detection case study using YOLOv2. more »

Award ID(s):: 2038855 1837337

PAR ID:: 10480322

Author(s) / Creator(s):: Bakita, Joshua; Anderson, James H.

Publisher / Repository:: IEEE

Date Published:: 2023-05-01

Journal Name:: Proceedings of the 29th IEEE Real-Time and Embedded Technology and Applications Symposium

ISBN:: 979-8-3503-2176-0

Page Range / eLocation ID:: 54 to 66

Format(s):: Medium: X

Location:: San Antonio, TX, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/RTAS58335.2023.00012

More Like this