AGILE: Lightweight and Efficient Asynchronous GPU-SSD Integration

Yang, Zhuoping (ORCID:0000000276554080); Zhuang, Jinming (ORCID:000000033659339X); Chen, Xingzhen (ORCID:0000000348653708); Jones, Alex (ORCID:0000000174980206); Zhou, Peipei (ORCID:0000000204931844)

doi:10.1145/3712285.3759778

Citation Details

This content will become publicly available on November 15, 2026

AGILE: Lightweight and Efficient Asynchronous GPU-SSD Integration

GPUs are critical for compute-intensive applications, yet emerging workloads such as recommender systems, graph analytics, and data analytics often exceed GPU memory capacity. Existing solutions allow GPUs to use CPU DRAM or SSDs as external memory, and the GPU-centric approach enables GPU threads to directly issue NVMe requests, further avoiding CPU intervention. However, current GPU-centric approaches adopt synchronous I/O, forcing threads to stall during long communication delays. We propose AGILE, a lightweight asynchronous GPU-centric I/O library that eliminates deadlock risks and integrates a flexible HBM-based software cache. AGILE overlaps computation and I/O, improving performance by up to 1.88 × across workloads with diverse computation-to-communication ratios. Compared to BaM on DLRM, AGILE achieves up to 1.75 × speedup through efficient design and overlapping; on graph applications, AGILE reduces software cache overhead by up to 3.12 × and NVMe I/O overhead by up to 2.85 × ; AGILE also lowers per-thread register usage by up to 1.32 ×. more »

Award ID(s):: 2217003 2213701 2511445 2536952 2328972 2324864

PAR ID:: 10647798

Author(s) / Creator(s):: Yang, Zhuoping; Zhuang, Jinming; Chen, Xingzhen; Jones, Alex; Zhou, Peipei

Publisher / Repository:: ACM

Date Published:: 2025-11-15

Page Range / eLocation ID:: 1028 to 1042

Subject(s) / Keyword(s):: GPUs SSDs Asynchronous I/O Software-managed cache Memory hierarchy Storage systems

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on November 15, 2026
Conference Paper:
https://doi.org/10.1145/3712285.3759778

More Like this