GCAPS: GPU Context-Aware Preemptive Priority-Based Scheduling for Real-Time Tasks

Wang, Yidi; Liu, Cong; Wong, Daniel; Kim, Hyoseung

doi:10.4230/LIPIcs.ECRTS.2024.14

Citation Details

GCAPS: GPU Context-Aware Preemptive Priority-Based Scheduling for Real-Time Tasks

Scheduling real-time tasks that utilize GPUs with analyzable guarantees poses a significant challenge due to the intricate interaction between CPU and GPU resources, as well as the complex GPU hardware and software stack. While much research has been conducted in the real-time research community, several limitations persist, including the absence or limited availability of GPU-level preemption, extended blocking times, and/or the need for extensive modifications to program code. In this paper, we propose GCAPS, a GPU Context-Aware Preemptive Scheduling approach for real-time GPU tasks. Our approach exerts control over GPU context scheduling at the device driver level and enables preemption of GPU execution based on task priorities by simply adding one-line macros to GPU segment boundaries. In addition, we provide a comprehensive response time analysis of GPU-using tasks for both our proposed approach as well as the default Nvidia GPU driver scheduling that follows a work-conserving round-robin policy. Through empirical evaluations and case studies, we demonstrate the effectiveness of the proposed approaches in improving taskset schedulability and response time. The results highlight significant improvements over prior work as well as the default scheduling approach, with up to 40% higher schedulability, while also achieving predictable worst-case behavior on Nvidia Jetson embedded platforms. more »

Award ID(s):: 1955650 2312395 1943265 2312397 2230969

PAR ID:: 10527894

Author(s) / Creator(s):: Wang, Yidi; Liu, Cong; Wong, Daniel; Kim, Hyoseung

Editor(s):: Pellizzoni, Rodolfo

Publisher / Repository:: Schloss Dagstuhl – Leibniz-Zentrum für Informatik

Date Published:: 2024-01-01

Volume:: 298

ISSN:: 1868-8969

ISBN:: 978-3-95977-324-9

Page Range / eLocation ID:: 298-298

Subject(s) / Keyword(s):: Real-time systems GPU scheduling Computer systems organization → Real-time systems Computer systems organization → Embedded and cyber-physical systems

Format(s):: Medium: X Size: 25 pages; 1914695 bytes Other: application/pdf

Size(s):: 25 pages 1914695 bytes

Right(s):: Creative Commons Attribution 4.0 International license; info:eu-repo/semantics/openAccess

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.4230/LIPIcs.ECRTS.2024.14

More Like this