Scalable Breadth-First Search on a GPU Cluster

Pan, Yuechao; Pearce, Roger; Owens, John D.

doi:10.1109/IPDPS.2018.00118

Citation Details

Scalable Breadth-First Search on a GPU Cluster

On a GPU cluster, the ratio of high computing power to communication bandwidth makes scaling breadth-first search (BFS) on a scale-free graph extremely challenging. By separating high and low out-degree vertices, we present an implementation with scalable computation and a model for scalable communication for BFS and direction-optimized BFS. Our communication model uses global reduction for high-degree vertices, and point-to-point transmission for low-degree vertices. Leveraging the characteristics of degree separation, we reduce the graph size to one third of the conventional edge list representation. With several other optimizations, we observe linear weak scaling as we increase the number of GPUs, and achieve 259.8 GTEPS on a scale-33 Graph500 RMAT graph with 124 GPUs on the latest CORAL early access system.Proceedings of the 31st IEEE International Parallel and Distributed Processing Symposium more »

Award ID(s):: 1740333 1629657

PAR ID:: 10066971

Author(s) / Creator(s):: Pan, Yuechao; Pearce, Roger; Owens, John D.

Date Published:: 2018-05-01

Journal Name:: Proceedings of the 31st IEEE International Parallel and Distributed Processing Symposium

Page Range / eLocation ID:: 1090 to 1101

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/IPDPS.2018.00118

More Like this