Search for: All records

Creators/Authors contains: "Cappello, Franck"

« Prev Next »

Total Resources

49

Resource Type
Conference Paper

43

Conference Proceeding

0

Dataset

0

Journal Article

6

Workshop Report

0

Availability
Full Text / Resource Available

39

Citation Only

10

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Towards Efficient I/O Pipelines using Accumulated Compression

Maurya, Avinash ; Nicolae, Bogdan ; Rafique, M. Mustafa ; Cappello, Franck ( December 2023 , IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC))

Free, publicly-accessible full text available December 20, 2024
Towards Efficient I/O Pipelines using Accumulated Compression

Maurya, Avinash ; Nicolae, Bogdan ; Rafique, M. Mustafa ; Cappello, Franck ( December 2023 , IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC))

Free, publicly-accessible full text available December 18, 2024
AMRIC: A Novel In Situ Lossy Compression Framework for Efficient I/O in Adaptive Mesh Refinement Applications

https://doi.org/10.1145/3581784.3613212

Wang, Daoce ; Pulido, Jesus ; Grosset, Pascal ; Tian, Jiannan ; Jin, Sian ; Tang, Houjun ; Sexton, Jean ; Di, Sheng ; Zhao, Kai ; Fang, Bo ; et al ( November 2023 , ACM)
GPU-Enabled Asynchronous Multi-level Checkpoint Caching and Prefetching

https://doi.org/10.1145/3588195.3592987

Maurya, Avinash ; Rafique, M. Mustafa ; Tonellot, Thierry ; AlSalem, Hussain J. ; Cappello, Franck ; Nicolae, Bogdan ( August 2023 , ACM)

Free, publicly-accessible full text available August 7, 2024
Lightweight Huffman Coding for Efficient GPU Compression

https://doi.org/10.1145/3577193.3593736

Shah, Milan ; Yu, Xiaodong ; Di, Sheng ; Becchi, Michela ; Cappello, Franck ( June 2023 , ICS '23: Proceedings of the 37th International Conference on Supercomputing)

Free, publicly-accessible full text available June 21, 2024
FAZ: A flexible auto-tuned modular error-bounded compression framework for scientific data

https://doi.org/10.1145/3577193.3593721

Liu, Jinyang ; Di, Sheng ; Zhao, Kai ; Liang, Xin ; Chen, Zizhong ; Cappello, Franck ( June 2023 , ICS '23: Proceedings of the 37th International Conference on Supercomputing)

Free, publicly-accessible full text available June 21, 2024
GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs

https://doi.org/10.1145/3577193.3593706

Zhang, Boyuan ; Tian, Jiannan ; Di, Sheng ; Yu, Xiaodong ; Swany, Martin ; Tao, Dingwen ; Cappello, Franck ( June 2023 , ICS '23: Proceedings of the 37th International Conference on Supercomputing)

Free, publicly-accessible full text available June 21, 2024
GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs

Zhang, Boyuan ; Tian, Jiannan ; Di, Sheng ; Yu, Xiaodong ; Swany, Martin ; Tao, Dingwen ; Cappello, Franck ( June 2023 , The 37th ACM International Conference on Supercomputing (ICS 2023))

Free, publicly-accessible full text available June 21, 2024
FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs

https://doi.org/10.1145/3588195.3592994

Zhang, Boyuan ; Tian, Jiannan ; Di, Sheng ; Yu, Xiaodong ; Feng, Yunhe ; Liang, Xin ; Tao, Dingwen ; Cappello, Franck ( June 2023 , The 32nd ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC 2023))

Today’s large-scale scientific applications running on high-performance computing (HPC) systems generate vast data volumes. Thus, data compression is becoming a critical technique to mitigate the storage burden and data-movement cost. However, existing lossy compressors for scientific data cannot achieve a high compression ratio and throughput simultaneously, hindering their adoption in many applications requiring fast compression, such as in-memory compression. To this end, in this work, we develop a fast and high-ratio error-bounded lossy compressor on GPUs for scientific data (called FZ-GPU). Specifically, we first design a new compression pipeline that consists of fully parallelized quantization, bitshuffle, and our newly designed fast encoding. Then, we propose a series of deep architectural optimizations for each kernel in the pipeline to take full advantage of CUDA architectures. We propose a warp-level optimization to avoid data conflicts for bit-wise operations in bitshuffle, maximize shared memory utilization, and eliminate unnecessary data movements by fusing different compression kernels. Finally, we evaluate FZ-GPU on two NVIDIA GPUs (i.e., A100 and RTX A4000) using six representative scientific datasets from SDRBench. Results on the A100 GPU show that FZ-GPU achieves an average speedup of 4.2× over cuSZ and an average speedup of 37.0× over a multi-threaded CPU implementation of our algorithm under the same error bound. FZ-GPU also achieves an average speedup of 2.3× and an average compression ratio improvement of 2.0× over cuZFP under the same data distortion.
more » « less
Free, publicly-accessible full text available June 16, 2024
GPU-Accelerated Error-Bounded Compression Framework for Quantum Circuit Simulations

https://doi.org/10.1109/IPDPS54959.2023.00081

Shah, Milan ; Yu, Xiaodong ; Di, Sheng ; Lykov, Danylo ; Alexeev, Yuri ; Becchi, Michela ; Cappello, Franck ( May 2023 , 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS))

Quantum circuit simulations enable researchers to develop quantum algorithms without the need for a physical quantum computer. Quantum computing simulators, however, all suffer from significant memory footprint requirements, which prevents large circuits from being simulated on classical super-computers. In this paper, we explore different lossy compression strategies to substantially shrink quantum circuit tensors in the QTensor package (a state-of-the-art tensor network quantum circuit simulator) while ensuring the reconstructed data satisfy the user-needed fidelity.Our contribution is fourfold. (1) We propose a series of optimized pre- and post-processing steps to boost the compression ratio of tensors with a very limited performance overhead. (2) We characterize the impact of lossy decompressed data on quantum circuit simulation results, and leverage the analysis to ensure the fidelity of reconstructed data. (3) We propose a configurable compression framework for GPU based on cuSZ and cuSZx, two state-of-the-art GPU-accelerated lossy compressors, to address different use-cases: either prioritizing compression ratios or prioritizing compression speed. (4) We perform a comprehensive evaluation by running 9 state-of-the-art compressors on an NVIDIA A100 GPU based on QTensor-generated tensors of varying sizes. When prioritizing compression ratio, our results show that our strategies can increase the compression ratio nearly 10 times compared to using only cuSZ. When prioritizing throughput, we can perform compression at the comparable speed as cuSZx while achieving 3-4× higher compression ratios. Decompressed tensors can be used in QTensor circuit simulation to yield a final energy result within 1-5% of the true energy value.
more » « less
Free, publicly-accessible full text available May 1, 2024

« Prev Next »