HTCondor data movement at 100 Gbps

Sfiligoi, Igor; Wurthwein, Frank; DeFanti, Thomas; Graham, John

doi:10.1109/eScience51609.2021.00040

Citation Details

HTCondor data movement at 100 Gbps

HTCondor is a major workload management system used in distributed high throughput computing (dHTC) environments, e.g., the Open Science Grid. One of the distinguishing features of HTCondor is the native support for data movement, allowing it to operate without a shared filesystem. Coupling data handling and compute scheduling is both convenient for users and allows for significant infrastructure flexibility but does introduce some limitations. The default HTCondor data transfer mechanism routes both the input and output data through the submission node, making it a potential bottleneck. In this document we show that by using a node equipped with a 100 Gbps network interface (NIC) HTCondor can serve data at up to 90 Gbps, which is sufficient for most current use cases, as it would saturate the border network links of most research universities at the time of writing. more »

Award ID(s):: 2030508

PAR ID:: 10357989

Author(s) / Creator(s):: Sfiligoi, Igor; Wurthwein, Frank; DeFanti, Thomas; Graham, John

Date Published:: 2021-09-01

Journal Name:: 2021 IEEE 17th International Conference on eScience (eScience)

Issue:: September 2021

Page Range / eLocation ID:: 239 to 240

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/eScience51609.2021.00040

More Like this