NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multifacets of lossy compression for scientific data in the Joint-Laboratory of Extreme Scale Computing

https://doi.org/10.1016/j.future.2024.05.022

Cappello, Franck; Acosta, Mario; Agullo, Emmanuel; Anzt, Hartwig; Calhoun, Jon; Di, Sheng; Giraud, Luc; Grützmacher, Thomas; Jin, Sian; Sano, Kentaro; et al (February 2025, Future Generation Computer Systems)

Full Text Available
SZOps: Scalar Operations for Error-bounded Lossy Compressor for Scientific Data

https://doi.org/10.1109/SCW63240.2024.00042

Agarwal, Tripti; Di, Sheng; Huang, Jiajun; Huang, Yafan; Gopalakrishnan, Ganesh; Underwood, Robert; Zhao, Kai; Liang, Xin; Li, Guanpeng; Cappello, Franck (November 2024, IEEE)

Error-bounded lossy compression has been a critical technique to significantly reduce the sheer amounts of simulation datasets for high-performance computing (HPC) scientific applications while effectively controlling the data distortion based on user-specified error bound. In many real-world use cases, users must perform computational operations on the compressed data. However, none of the existing error-bounded lossy compressors support operations, inevitably resulting in undesired decompression costs. In this paper, we propose a novel error-bounded lossy compressor (called SZOps), which supports not only error-bounding features but efficient computations (including negation, scalar addition, scalar multiplication, mean, variance, etc.) on the compressed data without the complete decompression step, which is the first attempt to the best of our knowledge. We develop several optimization strategies to maximize the overall compression ratio and execution performance. We evaluate SZOps compared to other state-of-the-art lossy compressors based on multiple real-world scientific application datasets.
more » « less
Full Text Available
hZCCL: Accelerating Collective Communication with Co-Designed Homomorphic Compression

https://doi.org/10.1109/SC41406.2024.00110

Huang, Jiajun; Di, Sheng; Yu, Xiaodong; Zhai, Yujia; Liu, Jinyang; Jian, Zizhe; Liang, Xin; Zhao, Kai; Lu, Xiaoyi; Chen, Zizhong; et al (November 2024, IEEE)

Full Text Available
CUSZP2: A GPU Lossy Compressor with Extreme Throughput and Optimized Compression Ratio

https://doi.org/10.1109/SC41406.2024.00021

Huang, Yafan; Di, Sheng; Li, Guanpeng; Cappello, Franck (November 2024, IEEE)

Full Text Available
CUSZ-i: High-Ratio Scientific Lossy Compression on GPUs with Optimized Multi-Level Interpolation

https://doi.org/10.1109/SC41406.2024.00019

Liu, Jinyang; Tian, Jiannan; Wu, Shixun; Di, Sheng; Zhang, Boyuan; Underwood, Robert; Huang, Yafan; Huang, Jiajun; Zhao, Kai; Li, Guanpeng; et al (November 2024, IEEE)

Full Text Available
Significantly Improving Fixed-Ratio Compression Framework for Resource-limited Applications

https://doi.org/10.1145/3673038.3673092

Nguyen, Tri; Rahman, Md Hasanur; Di, Sheng; Becchi, Michela (August 2024, ACM)

Scientific simulations running on HPC facilities generate massive amount of data, putting significant pressure onto supercomputers’ storage capacity and network bandwidth. To alleviate this problem, there has been a rich body of work on reducing data volumes via error-controlled lossy compression. However, fixed-ratio compression is not very well-supported, not allowing users to appropriately allocate memory/storage space or know the data transfer time over the network in advance. To address this problem, recent ratio-controlled frameworks, such as FXRZ, have incorporated methods to predict required error bound settings to reach a user-specified compression ratio. However, these approaches fail to achieve fixed-ratio compression in an accurate, efficient and scalable fashion on diverse datasets and compression algorithms. This work proposes an efficient, scalable, ratio-controlled lossy compression framework (CAROL). At the core of CAROL are four optimization strategies that allow for improving the prediction accuracy and runtime efficiency over state-of-the-art solutions. First, CAROL uses surrogate-based compression ratio estimation to generate training data. Second, it includes a novel calibration method to improve prediction accuracy across a variety of compressors. Third, it leverages Bayesian optimization to allow for efficient training and incremental model refinement. Forth, it uses GPU acceleration to speed up prediction. We evaluate CAROL on four compression algorithms and six scientific datasets. On average, when compared to the state-of-the-art FXRZ framework, CAROL achieves 4 × speedup in setup time and 36 × speedup in inference time, while maintaining less than 1% difference in estimation accuracy.
more » « less
Full Text Available
CereSZ: Enabling and Scaling Error-bounded Lossy Compression on Cerebras CS-2

https://doi.org/10.1145/3625549.3658691

Song, Shihui; Huang, Yafan; Jiang, Peng; Yu, Xiaodong; Zheng, Weijian; Di, Sheng; Cao, Qinglei; Feng, Yunhe; Xie, Zhen; Cappello, Franck (June 2024, ACM)

Full Text Available
A Portable, Fast, DCT-based Compressor for AI Accelerators

https://doi.org/10.1145/3625549.3658662

Shah, Milan; Yu, Xiaodong; Di, Sheng; Becchi, Michela; Cappello, Franck (June 2024, ACM)

Full Text Available
gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters

https://doi.org/10.1145/3650200.3656636

Huang, Jiajun; Di, Sheng; Yu, Xiaodong; Zhai, Yujia; Liu, Jinyang; Huang, Yafan; Raffenetti, Ken; Zhou, Hui; Zhao, Kai; Lu, Xiaoyi; et al (May 2024, ACM)

Full Text Available
An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression

https://doi.org/10.1109/IPDPS57955.2024.00072

Huang, Jiajun; Di, Sheng; Yu, Xiaodong; Zhai, Yujia; Zhang, Zhaorui; Liu, Jinyang; Lu, Xiaoyi; Raffenetti, Ken; Zhou, Hui; Zhao, Kai; et al (May 2024, IEEE)

Full Text Available

« Prev Next »

Search for: All records