skip to main content


Title: Modeling I/O performance variability in high-performance computing systems using mixture distributions
Award ID(s):
1838271
NSF-PAR ID:
10169775
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Journal of Parallel and Distributed Computing
Volume:
139
Issue:
C
ISSN:
0743-7315
Page Range / eLocation ID:
87 to 98
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The performance of compute hardware varies: software run repeatedly on the same server (or a different server with supposedly identical parts) can produce performance results that differ with each execution. This variation has important effects on the reproducibility of systems research and ability to quantitatively compare the performance of different systems. It also has implications for commercial computing, where agreements are often made conditioned on meeting specific performance targets. Over a period of 10 months, we conducted a large-scale study capturing nearly 900,000 data points from 835 servers. We examine this data from two perspectives: that of a service provider wishing to offer a consistent environment, and that of a systems researcher who must understand how variability impacts experimental results. From this examination, we draw a number of lessons about the types and magnitudes of performance variability and the effects on confidence in experiment results. We also create a statistical model that can be used to understand how representative an individual server is of the general population. The full dataset and our analysis tools are publicly available, and we have built a system to interactively explore the data and make recommendations for experiment parameters based on statistical analysis of historical data. 
    more » « less
  2. In this paper, we study the impacts of latency variation versus latency mean on application runtime, library performance, and packet delivery. Our contributions include the design and implementation of a network latency injector that is suitable for most QLogic and Mellanox InfiniBand cards. We fit statistical distributions of latency mean and variation to varying levels of network contention for a range of parallel application workloads. We use the statistical distributions to characterize the latency variation impacts to application degradation. The level of application degradation caused by variation in network latency depends on application characteristics, and can be significant. Observed degradation varies from no degradation for applications without communicating processes to 3.5 times slower for communication-intensive parallel applications. We support our results with statistical analysis of our experimental observations. For communication-intensive high performance computing applications, we show statistically significant evidence that changes in performance are more highly correlated with changes of variation in network latency than with changes of mean network latency alone. 
    more » « less
  3. Synopsis

    To understand the complexities of morphological evolution, we must understand the relationships between genes, morphology, performance, and fitness in complex traits. Genomicists have made tremendous progress in finding the genetic basis of many phenotypes, including a myriad of morphological characters. Similarly, field biologists have greatly advanced our understanding of the relationship between performance and fitness in natural populations. However, the connection from morphology to performance has primarily been studied at the interspecific level, meaning that in most cases we lack a mechanistic understanding of how evolutionarily relevant variation among individuals affects organismal performance. Therefore, functional morphologists need methods that will allow for the analysis of fine-grained intraspecific variation in order to close the path from genes to fitness. We suggest three methodological areas that we believe are well suited for this research program and provide examples of how each can be applied within fish model systems to build our understanding of microevolutionary processes. Specifically, we believe that structural equation modeling, biological robotics, and simultaneous multi-modal functional data acquisition will open up fruitful collaborations among biomechanists, evolutionary biologists, and field biologists. It is only through the combined efforts of all three fields that we will understand the connection between evolution (acting at the level of genes) and natural selection (acting on fitness).

     
    more » « less