Measuring Network Latency Variation Impacts to High Performance Computing Application Performance

Underwood, Robert; Anderson, Jason; Apon, Amy

doi:10.1145/3184407.3184427

Citation Details

Measuring Network Latency Variation Impacts to High Performance Computing Application Performance

In this paper, we study the impacts of latency variation versus latency mean on application runtime, library performance, and packet delivery. Our contributions include the design and implementation of a network latency injector that is suitable for most QLogic and Mellanox InfiniBand cards. We fit statistical distributions of latency mean and variation to varying levels of network contention for a range of parallel application workloads. We use the statistical distributions to characterize the latency variation impacts to application degradation. The level of application degradation caused by variation in network latency depends on application characteristics, and can be significant. Observed degradation varies from no degradation for applications without communicating processes to 3.5 times slower for communication-intensive parallel applications. We support our results with statistical analysis of our experimental observations. For communication-intensive high performance computing applications, we show statistically significant evidence that changes in performance are more highly correlated with changes of variation in network latency than with changes of mean network latency alone. more »

Award ID(s):: 1642542 1405767 1725573 1633608

PAR ID:: 10060492

Author(s) / Creator(s):: Underwood, Robert; Anderson, Jason; Apon, Amy

Date Published:: 2018-04-13

Journal Name:: ICPE '18 Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering

Page Range / eLocation ID:: 68 to 79

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3184407.3184427

More Like this