Node-Aware Stencil Communication for Heterogeneous Supercomputers

Pearson, Carl; Hidayetoglu, Mert; Almasri, Mohammad; Anjum, Omer; Chung, I-Hsin; Xiong, Jinjun; Hwu, Wen-Mei W.

doi:10.1109/IPDPSW50202.2020.00136

Citation Details

Node-Aware Stencil Communication for Heterogeneous Supercomputers

High-performance distributed computing systems increasingly feature nodes that have multiple CPU sockets and multiple GPUs. The communication bandwidth between these components is non-uniform. Furthermore, these systems can expose different communication capabilities between these components. For communication-heavy applications, optimally using these capabilities is challenging and essential for performance. Bespoke codes with optimized communication may be non-portable across run-time/software/hardware configurations, and existing stencil frameworks neglect optimized communication. This work presents node-aware approaches for automatic data placement and communication implementation for 3D stencil codes on multi-GPU nodes with non-homogeneous communication performance and capabilities. Benchmarking results in the Summit system show that choices in placement can result in a 20% improvement in single-node exchange, and communication specialization can yield a further 6x improvement in exchange time in a single node, and a 16% improvement at 1536 GPUs. more »

Award ID(s):: 1725729

PAR ID:: 10190061

Author(s) / Creator(s):: Pearson, Carl; Hidayetoglu, Mert; Almasri, Mohammad; Anjum, Omer; Chung, I-Hsin; Xiong, Jinjun; Hwu, Wen-Mei W.

Date Published:: 2020-05-01

Journal Name:: 2020 IEEE International Parallel and Distributed Processing Symposium Workshops

Page Range / eLocation ID:: 796 to 805

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/IPDPSW50202.2020.00136

More Like this