IcedTea: Efficient and Responsive Time-Travel Debugging in Dataflow Systems

Ni, Shengquan; Huang, Yicong; Wang, Zuozhi; Li, Chen

doi:10.14778/3712221.3712251

Citation Details

This content will become publicly available on September 1, 2026

IcedTea: Efficient and Responsive Time-Travel Debugging in Dataflow Systems

Dataflow systems have an increasing need to support a wide range of tasks in data-centric applications using latest techniques such as machine learning. These tasks often involve custom functions with complex internal states. Consequently, users need enhanced debugging support to understand runtime behaviors and investigate internal states of dataflows. Traditional forward debuggers allow users to follow the chronological order of operations in an execution. Therefore, a user cannot easily identify a past runtime behavior after an unexpected result is produced. In this paper, we present a novel time-travel debugging paradigm called IcedTea, which supports reverse debugging. In particular, in a dataflow's execution, which is inherently distributed across multiple operators, the user can periodically interact with the job and retrieve the global states of the operators. After the execution, the system allows the user to roll back the dataflow state to any past interactions. The user can use step instructions to repeat the past execution to understand how data was processed in the original execution. We give a full specification of this powerful paradigm, study how to reduce its runtime overhead and develop techniques to support debugging instructions responsively. Our experiments on real-world datasets and workflows show that IcedTea can support responsive time-travel debugging with low time and space overhead. more »

Award ID(s):: 2200274 2106859

PAR ID:: 10636057

Author(s) / Creator(s):: Ni, Shengquan; Huang, Yicong; Wang, Zuozhi; Li, Chen

Publisher / Repository:: VLDB

Date Published:: 2025-09-01

Journal Name:: Proceedings of the VLDB Endowment

Volume:: 18

Issue:: 3

ISSN:: 2150-8097

Page Range / eLocation ID:: 902 to 914

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on September 1, 2026
Journal Article:
https://doi.org/10.14778/3712221.3712251

More Like this