Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way

Chen, Yuxin; Brock, Benjamin; Porumbescu, Șerban; Buluc, Aydın; Yelick, Katherine; Owens, John D.

doi:10.1109/SC41404.2022.00055

Citation Details

Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way

We present Atos, a dynamic scheduling framework for multi-node-GPU systems that supports PGAS-style lightweight one-sided memory operations within and between nodes. Atos's lightweight GPU-to-GPU communication enables latency hiding and can smooth the interconnection usage for bisection-limited problems. These benefits are significant for dynamic, irregular applications that often involve fine-grained communication at unpredictable times and without predetermined patterns. Some principles for high performance: (1) do not involve the CPU in the communication control path; (2) allow GPU communication within kernels, addressing memory consistency directly rather than relying on synchronization with the CPU; (3) perform dynamic communication aggregation when interconnections have limited bandwidth. By lowering the overhead of communication and allowing it within GPU kernels, we support large, high-utilization GPU kernels but with more frequent communication. We evaluate Atos on two irregular problems: Breadth-First-Search and PageRank. Atos outperforms the state-of-the-art graph libraries Gunrock, Groute and Galois on both single-node-multi-GPU and multi-node-GPU settings. more »

Award ID(s):: 1740333 1823037

PAR ID:: 10397861

Author(s) / Creator(s):: Chen, Yuxin; Brock, Benjamin; Porumbescu, Șerban; Buluc, Aydın; Yelick, Katherine; Owens, John D.

Date Published:: 2022-11-16

Journal Name:: International Conference for High Performance Computing Networking Storage and Analysis

ISSN:: 2167-4329

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
https://doi.org/10.1109/SC41404.2022.00055

More Like this