A Distributed Matrix-Block-Vector Multiplication in Presence of System Performance Variability

Ma, Yuchen; Stathopoulos, Andreas; Ren, Bin

doi:10.1145/3774934.3786453

Citation Details

A Distributed Matrix-Block-Vector Multiplication in Presence of System Performance Variability

Distributed matrix-block-vector multiplication (Matvec) algorithm is a critical component of many applications, but can be computationally challenging for dense matrices of dimension \(O(10^6\text{--}10^7)\) and blocks of \(O(10\text{--}100)\) vectors. We present performance analysis, implementation, and optimization of our \pname{} library for Matvec under the effect of system variability. Our modeling shows that 1D pipelining Matvec is as efficient as 2D algorithms at small to medium clusters, which are sufficient for these problem sizes. We develop a performance tracing framework and a simulator that reveal pipeline bubbles caused by modest \textasciitilde{}5\% system variability. To tolerate such variability, our \pname{} library, which combines on-the-fly kernel matrix generation and Matvec, integrates four optimizations: inter-process data preloading, unconventional static thread scheduling, cache-aware tiling, and multi-version unrolling. In our benchmarks on \(O(10^5)\) Matvec problems, \pname{} achieves up to 1.85× speedup over COSMA and 17× over ScaLAPACK. For \(O(10^6)\) problems, where COSMA and ScaLAPACK exceed memory capacity, \pname{} maintains linear strong scaling and achieves peak performance of 75\%~FMA~Flop/s. Its static scheduling policy has a 2.27× speedup compared to the conventional work-stealing dynamic scheduler, and is predicted to withstand up to 108\% performance variability under exponential distributed variability simulation. more »

Award ID(s):: 2008557

PAR ID:: 10658757

Author(s) / Creator(s):: Ma, Yuchen; Stathopoulos, Andreas; Ren, Bin

Publisher / Repository:: Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming}{January 31 -- February 4, 2026

Date Published:: 2026-01-31

ISBN:: 979-8-4007-2310-0

Format(s):: Medium: X

Location:: Sydney, NSW, Australia

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
https://doi.org/10.1145/3774934.3786453

More Like this