Reshaping High Energy Physics Applications for Near-Interactive Execution Using TaskVine

Sly-Delgado, Barry; Tovar, Ben; Zhou, Jin; Thain, Douglas

doi:10.1109/SC41406.2024.00068

Citation Details

This content will become publicly available on November 17, 2025

Reshaping High Energy Physics Applications for Near-Interactive Execution Using TaskVine

High energy physics experiments produce petabytes of data annually that must be reduced to gain insight into the laws of nature. Early-stage reduction executes long-running high-throughput workflows across thousands of nodes spanning multiple facilities to produce shared datasets. Later stages are typically written by individuals or small groups and must be refined and re-run many times for correctness. Reducing iteration times of later stages is key to accelerating discovery. We demonstrate our experience reshaping late-stage analysis applications on thousands of nodes. It is not enough merely to increase scale: it is necessary to make changes throughout the stack, including storage systems, data management, task scheduling, and application design. We demonstrate these changes when applied to two analysis applications built on open source data analysis frameworks (Coffea, Dask, TaskVine). We evaluate the performance of the applications on opportunistic campus clusters, showing effective scaling up to 7200 cores, thus producing significant speedup. more »

Award ID(s):: 1931348

PAR ID:: 10567688

Author(s) / Creator(s):: Sly-Delgado, Barry; Tovar, Ben; Zhou, Jin; Thain, Douglas

Publisher / Repository:: IEEE

Date Published:: 2024-11-17

ISBN:: 979-8-3503-5291-7

Page Range / eLocation ID:: 1 to 13

Format(s):: Medium: X

Location:: Atlanta, GA, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on November 17, 2025
Conference Paper:
https://doi.org/10.1109/SC41406.2024.00068

More Like this