MFNetSim: A Multi-Fidelity Network Simulation Framework for Multi-Traffic Modeling of Dragonfly Systems

Wang, Xin; Brown, Kevin A; Ross, Robert B; Carothers, Christopher D; Lan, Zhiling

doi:10.1145/3729424

Citation Details

This content will become publicly available on September 12, 2026

MFNetSim: A Multi-Fidelity Network Simulation Framework for Multi-Traffic Modeling of Dragonfly Systems

In high-performance computing (HPC), modern supercomputers typically provide exclusive computing resources to user applications. Nevertheless, the interconnect network is a shared resource for both inter-node communication and across-node I/O access, among co-running workloads, leading to inevitable network interference. In this study, we develop MFNetSim, a multi-fidelity modeling framework that enables simulation of multi-traffic simultaneously over the interconnect network, including inter-process communication and I/O traffic. By combining different levels of abstraction, MFNetSim can efficiently co-model the communication and I/O traffic occurring on HPC systems equipped with flash-based storage. We conduct simulation studies of hybrid workloads composed of traditional HPC applications and emerging ML applications on a 1,056-node Dragonfly system with various configurations. Our analysis provides various observations regarding how network interference affects communication and I/O traffic. more »

Award ID(s):: 2413597 2402901

PAR ID:: 10616977

Author(s) / Creator(s):: Wang, Xin; Brown, Kevin A; Ross, Robert B; Carothers, Christopher D; Lan, Zhiling

Publisher / Repository:: ACM

Date Published:: 2025-09-12

Journal Name:: ACM Transactions on Modeling and Computer Simulation

ISSN:: 1049-3301

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on September 12, 2026
Journal Article:
https://doi.org/10.1145/3729424

More Like this