Two-Face: Combining Collective and One-Sided Communication for Efficient Distributed SpMM

Block, Charles; Gerogiannis, Gerasimos; Mendis, Charith; Azad, Ariful; Torrellas, Josep

doi:10.1145/3620665.3640427

Citation Details

Two-Face: Combining Collective and One-Sided Communication for Efficient Distributed SpMM

Sparse matrix dense matrix multiplication (SpMM) is commonly used in applications ranging from scientific computing to graph neural networks. Typically, when SpMM is executed in a distributed platform, communication costs dominate. Such costs depend on how communication is scheduled. If it is scheduled in a sparsity-unaware manner, such as with collectives, execution is often inefficient due to unnecessary data transfers. On the other hand, if communication is scheduled in a fine-grained sparsity-aware manner, communicating only the necessary data, execution can also be inefficient due to high software overhead. We observe that individual sparse matrices often contain regions that are denser and regions that are sparser. Based on this observation, we develop a model that partitions communication into sparsity-unaware and sparsity-aware components. Leveraging the partition, we develop a new algorithm that performs collective communication for the denser regions, and fine-grained, one-sided communication for the sparser regions. We call the algorithm Two-Face. We show that Two-Face attains an average speedup of 2.11x over prior work when evaluated on a 4096-core supercomputer. Additionally, Two-Face scales well with the machine size. more »

Award ID(s):: 2316234 2316233

PAR ID:: 10510623

Author(s) / Creator(s):: Block, Charles; Gerogiannis, Gerasimos; Mendis, Charith; Azad, Ariful; Torrellas, Josep

Publisher / Repository:: ACM

Date Published:: 2024-04-27

ISBN:: 9798400703850

Page Range / eLocation ID:: 1200 to 1217

Format(s):: Medium: X

Location:: La Jolla CA USA

Sponsoring Org:: National Science Foundation

Conference Paper:
https://doi.org/10.1145/3620665.3640427

More Like this