Poster: Minimizing Data Movement Using Distant Futures

Sly-Delgado, Barry; Thain, Douglas

Citation Details

Scientific workflows execute a series of tasks where each task may consume data as an input and produce data as an output. Within these workflows, tasks often produce intermediate results that may serve as inputs to subsequent tasks within the workflow. These results can vary in size and may need to be transported to another worker node. Data movement can become the primary bottleneck for many scientific workflows thus minimizing the cost of data movement can provide a significant performance benefit for a given workflow. Distant futures enable transfers between worker nodes, eliminating the need for intermediate results to pass through a centralized manager for future tasks invocations. Additionally, asynchronous transfers enable increased concurrency by preventing the blocking of task invocations. This poster shows the performance benefit received from the implementation of distant futures within a workflow that produces numerous intermediate results. more »

Award ID(s):: 1931348

PAR ID:: 10567840

Author(s) / Creator(s):: Sly-Delgado, Barry; Thain, Douglas

Publisher / Repository:: SC23: The International Conference for High Performance Computing, Networking, Storage, and Analysis

Date Published:: 2023-11-12

Format(s):: Medium: X

Institution:: SC23: The International Conference for High Performance Computing, Networking, Storage, and Analysis

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Posted Content:
The DOI is not currently available.

More Like this