Topology-custom UGAL routing on dragonfly

Rahman, Md Shafayat; Bhowmik, Saptarshi; Ryasnianskiy, Yevgeniy; Yuan, Xin; Lang, Michael

doi:10.1145/3295500.3356208

Citation Details

Topology-custom UGAL routing on dragonfly

The Dragonfly network has been deployed in the current generation supercomputers and will be used in the next generation supercomputers. The Universal Globally Adaptive Load-balance routing (UGAL) is the state-of-the-art routing scheme for Dragonfly. In this work, we show that the performance of the conventional UGAL can be further improved on many practical Dragonfly networks, especially the ones with a small number of groups, by customizing the paths used in UGAL for each topology. We develop a scheme to compute the custom sets of paths for each topology and compare the performance of our topology-custom UGAL routing (T-UGAL) with conventional UGAL. Our evaluation with different UGAL variations and different topologies demonstrates that by customizing the routes, T-UGAL offers significant improvements over UGAL on many practical Dragonfly networks in terms of both latency when the network is under low load and throughput when the network is under high load. more »

Award ID(s):: 1822737 1738912

PAR ID:: 10162956

Author(s) / Creator(s):: Rahman, Md Shafayat; Bhowmik, Saptarshi; Ryasnianskiy, Yevgeniy; Yuan, Xin; Lang, Michael

Date Published:: 2019-11-17

Journal Name:: SC '19: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

Page Range / eLocation ID:: 1 to 15

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3295500.3356208

More Like this