Maximizing Data Utility for HPC Python Workflow Execution

Phung, Thanh Son; Clifford, Ben; Chard, Kyle; Thain, Douglas

doi:10.1145/3624062.3624136

Citation Details

Maximizing Data Utility for HPC Python Workflow Execution

Large-scale HPC workflows are increasingly implemented in dy- namic languages such as Python, which allow for more rapid devel- opment than traditional techniques. However, the cost of executing Python applications at scale is often dominated by the distribution of common datasets and complex software dependencies. As the application scales up, data distribution becomes a limiting factor that prevents scaling beyond a few hundred nodes. To address this problem, we present the integration of Parsl (a Python-native paral- lel programming library) with TaskVine (a data-intensive workflow execution engine). Instead of relying on a shared filesystem to pro- vide data to tasks on demand, Parsl is able to express advance data needs to TaskVine, which then performs efficient data distribution at runtime. This combination provides a performance speedup of 1.48x over the typical method of on-demand paging from the shared filesystem, while also providing an average task speedup of 1.79x with 2048 tasks and 256 nodes. more »

Award ID(s):: 1931348

PAR ID:: 10567833

Author(s) / Creator(s):: Phung, Thanh Son; Clifford, Ben; Chard, Kyle; Thain, Douglas

Publisher / Repository:: ACM

Date Published:: 2023-11-12

ISBN:: 9798400707858

Page Range / eLocation ID:: 637 to 640

Format(s):: Medium: X

Location:: Denver CO USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3624062.3624136

More Like this