Accelerating DNN Inference with GraphBLAS and the GPU

Wang, Xiaoyun; Lin, Zhongyi; Yang, Carl; Owens, John D.

doi:10.1109/HPEC.2019.8916498

Citation Details

Accelerating DNN Inference with GraphBLAS and the GPU

This work addresses the 2019 Sparse Deep Neural Network Graph Challenge with an implementation of this challenge using the GraphBLAS programming model. We demonstrate our solution to this challenge with GraphBLAST, a GraphBLAS implementation on the GPU, and compare it to SuiteSparse, a GraphBLAS implementation on the CPU. The GraphBLAST implementation is 1.94× faster than Suite-Sparse; the primary opportunity to increase performance on the GPU is a higher-performance sparse-matrix-times-sparse-matrix (SpGEMM) kernel. more »

Award ID(s):: 1629657 1740333

PAR ID:: 10171725

Author(s) / Creator(s):: Wang, Xiaoyun; Lin, Zhongyi; Yang, Carl; Owens, John D.

Date Published:: 2019-09-01

Journal Name:: Proceedings of the IEEE High Performance Extreme Computing Conference

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/HPEC.2019.8916498

More Like this