NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Achieving the Performance of Global Adaptive Routing using Local Information on Dragonfly through Deep Learning

Chaulagain, Ram Sharan; Liza, Fatema Tabassum; Chunduri, Sudheer; Yuan, Xin; Lang, Michael (November 2020, ACM/IEEE SC tech poster)
null (Ed.)
he Universal Globally Adaptive Load-balance Routing (UGAL) with global information, referred as UGAL-G, represents an ideal form of adaptive routing on Dragonfly. UGAL-G is impractical to implement, however, since the global information cannot be maintained accurately. Practical adaptive routing schemes, such as UGAL with local information (UGAL-L), performs noticeably worse than UGAL-G. In this work, we investigate a machine learning approach for routing on Dragonfly. Specifically, we develop a machine learning-based routing scheme, called UGAL-ML, that is capable of making routing decisions like UGAL-G based only on the information local to each router. Our preliminary evaluation indicates that UGAL-ML can achieve comparable performance to UGAL-G for some traffic patterns.
more » « less
Full Text Available
Widespread introgression across a phylogeny of 155 Drosophila genomes

https://doi.org/10.1016/j.cub.2021.10.052

Suvorov, Anton; Kim, Bernard Y.; Wang, Jeremy; Armstrong, Ellie E.; Peede, David; D’Agostino, Emmanuel R.R.; Price, Donald K.; Waddell, Peter J.; Lang, Michael; Courtier-Orgogozo, Virginie; et al (January 2022, Current Biology)

Full Text Available
Topology-custom UGAL routing on dragonfly

https://doi.org/10.1145/3295500.3356208

Rahman, Md Shafayat; Bhowmik, Saptarshi; Ryasnianskiy, Yevgeniy; Yuan, Xin; Lang, Michael (November 2019, SC '19: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis)

The Dragonfly network has been deployed in the current generation supercomputers and will be used in the next generation supercomputers. The Universal Globally Adaptive Load-balance routing (UGAL) is the state-of-the-art routing scheme for Dragonfly. In this work, we show that the performance of the conventional UGAL can be further improved on many practical Dragonfly networks, especially the ones with a small number of groups, by customizing the paths used in UGAL for each topology. We develop a scheme to compute the custom sets of paths for each topology and compare the performance of our topology-custom UGAL routing (T-UGAL) with conventional UGAL. Our evaluation with different UGAL variations and different topologies demonstrates that by customizing the routes, T-UGAL offers significant improvements over UGAL on many practical Dragonfly networks in terms of both latency when the network is under low load and throughput when the network is under high load.
more » « less
Full Text Available
TCASM: An asynchronous shared memory interface for high-performance application composition

https://doi.org/10.1016/j.parco.2017.01.003

Otstott, Douglas; Ionkov, Latchesar; Lang, Michael; Zhao, Ming (April 2017, Parallel Computing)

Full Text Available

Search for: All records