A Modular Static Cost Analysis for GPU Warp-Level Parallelism - Artifact POPL2026

Blike, Gregory; Cogumbreiro, Tiago; LANGE, JULIEN; Zicarelli, Hannah; Sathiyamoorthy, Udaya

doi:10.6084/m9.figshare.30689102

Citation Details

A Modular Static Cost Analysis for GPU Warp-Level Parallelism - Artifact POPL2026

Graphics Processing Units (GPUs) are the accelerator of choice for performance-critical applications, yet optimizing for performance requires mastery of the complex interactions between its memory architecture and its execution model. Existing static analysis tools for GPU kernels either identify performance bugs without quantifying costs or cannot handle thread-divergent control flow, leading to significant over-approximations. We present the first static relational-cost analysis for GPU warp-level parallelism that can give exact bounds even in the presence of thread divergence. Our analysis is general and flexible, as it is parametric on the resource metric (uncoalesced accesses, bank conflicts) and on the cost relation (=, ≤, ≥). We establish a soundness theorem for our technique, provide mechanized proofs in Rocq and implement our theory in a tool called Pico. In a reproducibility experiment, Pico produced the tightest bounds in every input, outperforming the state-of-the-art tool RaCUDA in 10 kernels (1.7× better), while RaCUDA produced 4 incorrect bounds and crashed on 2 kernels. In an experiment to measure the accuracy of Pico, we studied the impact of thread-divergence in control-flow in a dataset of 226 kernels. We found that at least 75.3% of conditionals and 85.4% of loops can be captured exactly, without introducing approximation. more »

Award ID(s):: 2204986

PAR ID:: 10668317

Author(s) / Creator(s):: Blike, Gregory; Cogumbreiro, Tiago; LANGE, JULIEN; Zicarelli, Hannah; Sathiyamoorthy, Udaya

Publisher / Repository:: figshare

Date Published:: 2025-01-01

Edition / Version:: 1.0

Subject(s) / Keyword(s):: Formal methods for software Automated software engineering Software testing, verification and validation

Format(s):: Medium: X Size: 1954620573 Bytes

Size(s):: 1954620573 Bytes

Right(s):: MIT

Sponsoring Org:: National Science Foundation

Software:
https://doi.org/10.6084/m9.figshare.30689102

More Like this