Prediction for distributional outcomes in high-performance computing input/output variability

Xu, Li; Hong, Yili; Morris, Max_D; Cameron, Kirk_W

doi:10.1093/jrsssc/qlae001

Citation Details

Prediction for distributional outcomes in high-performance computing input/output variability

Abstract Although high-performance computing (HPC) systems have been scaled to meet the exponentially growing demand for scientific computing, HPC performance variability remains a major challenge in computer science. Statistically, performance variability can be characterized by a distribution. Predicting performance variability is a critical step in HPC performance variability management. In this article, we propose a new framework to predict performance distributions. The proposed framework is a modified Gaussian process that can predict the distribution function of the input/output (I/O) throughput under a specific HPC system configuration. We also impose a monotonic constraint so that the predicted function is nondecreasing, which is a property of the cumulative distribution function. Additionally, the proposed model can incorporate both quantitative and qualitative input variables. We predict the HPC I/O distribution using the proposed method for the IOzone variability data. Data analysis results show that our framework can generate accurate predictions, and outperform existing methods. We also show how the predicted functional output can be used to generate predictions for a scalar summary of the performance distribution, such as the mean, standard deviation, and quantiles. Our prediction results can further be used for HPC system variability monitoring and optimization. This article has online supplementary materials. more »

Award ID(s):: 1838271

PAR ID:: 10487097

Author(s) / Creator(s):: Xu, Li; Hong, Yili; Morris, Max_D; Cameron, Kirk_W

Publisher / Repository:: Oxford University Press

Date Published:: 2024-01-22

Journal Name:: Journal of the Royal Statistical Society Series C: Applied Statistics

Volume:: 73

Issue:: 3

ISSN:: 0035-9254

Format(s):: Medium: X Size: p. 561-580

Size(s):: p. 561-580

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/jrsssc/qlae001

More Like this