End-to-End LU Factorization of Large Matrices on GPUs

Xia, Yang; Jiang, Peng; Agrawal, Gagan; Ramnath, Rajiv

doi:10.1145/3572848.3577486

Citation Details

End-to-End LU Factorization of Large Matrices on GPUs

LU factorization for sparse matrices is an important computing step for many engineering and scientific problems such as circuit simulation. There have been many efforts toward parallelizing and scaling this algorithm, which include the recent efforts targeting the GPUs. However, it is still challenging to deploy a complete sparse LU factorization workflow on a GPU due to high memory requirements and data dependencies. In this paper, we propose the first complete GPU solution for sparse LU factorization. To achieve this goal, we propose an out-of-core implementation of the symbolic execution phase, thus removing the bottleneck due to large intermediate data structures. Next, we propose a dynamic parallelism implementation of Kahn's algorithm for topological sort on the GPUs. Finally, for the numeric factorization phase, we increase the parallelism degree by removing the memory limits for large matrices as compared to the existing implementation approaches. Experimental results show that compared with an implementation modified from GLU 3.0, our out-of-core version achieves speedups of 1.13--32.65X. Further, our out-of-core implementation achieves a speedup of 1.2--2.2 over an optimized unified memory implementation on the GPU. Finally, we show that the optimizations we introduce for numeric factorization turn out to be effective. more »

Award ID(s):: 2146873

PAR ID:: 10417475

Author(s) / Creator(s):: Xia, Yang; Jiang, Peng; Agrawal, Gagan; Ramnath, Rajiv

Date Published:: 2023-02-21

Journal Name:: PPoPP '23: Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming

Page Range / eLocation ID:: 288 to 300

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3572848.3577486

More Like this