FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs

Zhang, Boyuan; Tian, Jiannan; Di, Sheng; Yu, Xiaodong; Feng, Yunhe; Liang, Xin; Tao, Dingwen; Cappello, Franck

doi:10.1145/3588195.3592994

Citation Details

FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs

Today’s large-scale scientific applications running on high-performance computing (HPC) systems generate vast data volumes. Thus, data compression is becoming a critical technique to mitigate the storage burden and data-movement cost. However, existing lossy compressors for scientific data cannot achieve a high compression ratio and throughput simultaneously, hindering their adoption in many applications requiring fast compression, such as in-memory compression. To this end, in this work, we develop a fast and high-ratio error-bounded lossy compressor on GPUs for scientific data (called FZ-GPU). Specifically, we first design a new compression pipeline that consists of fully parallelized quantization, bitshuffle, and our newly designed fast encoding. Then, we propose a series of deep architectural optimizations for each kernel in the pipeline to take full advantage of CUDA architectures. We propose a warp-level optimization to avoid data conflicts for bit-wise operations in bitshuffle, maximize shared memory utilization, and eliminate unnecessary data movements by fusing different compression kernels. Finally, we evaluate FZ-GPU on two NVIDIA GPUs (i.e., A100 and RTX A4000) using six representative scientific datasets from SDRBench. Results on the A100 GPU show that FZ-GPU achieves an average speedup of 4.2× over cuSZ and an average speedup of 37.0× over a multi-threaded CPU implementation of our algorithm under the same error bound. FZ-GPU also achieves an average speedup of 2.3× and an average compression ratio improvement of 2.0× over cuZFP under the same data distortion. more »

Award ID(s):: 2312673 2232120 2303820 2034169 2303064 2042084 2247080 2104024

PAR ID:: 10408699

Author(s) / Creator(s):: Zhang, Boyuan; Tian, Jiannan; Di, Sheng; Yu, Xiaodong; Feng, Yunhe; Liang, Xin; Tao, Dingwen; Cappello, Franck

Publisher / Repository:: ACM

Date Published:: 2023-06-16

Journal Name:: The 32nd ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC 2023)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
https://doi.org/10.1145/3588195.3592994

More Like this