PRUNE: A preserving run environment for reproducible scientific computing

Ivie, Peter; Thain, Douglas

doi:10.1109/eScience.2016.7870886

Citation Details

PRUNE: A preserving run environment for reproducible scientific computing

Computing as a whole suffers from a crisis of reproducibility. Programs executed in one context are aston- ishingly hard to reproduce in another context, resulting in wasted effort by people and general distrust of results produced by computer. The root of the problem lies in the fact that every program has implicit dependencies on data and execution environment which are rarely understood by the end user. To address this problem, we present PRUNE, the Preserving Run Environment. In PRUNE, every task to be executed is wrapped in a functional interface and coupled with a strictly defined environment. The task is then executed by PRUNE rather than the user to ensure reproducibility. As a scientific workflow evolves in PRUNE, a growing but immutable tree of derived data is created. The provenance of every item in the system can be precisely described, facilitating sharing and modification between collaborating researchers, along with efficient management of limited storage space. We present the user interface and the initial prototype of PRUNE, and demonstrate its application in matching records and comparing surnames in U.S. Censuses more »

Award ID(s):: 1642409

PAR ID:: 10047189

Author(s) / Creator(s):: Ivie, Peter; Thain, Douglas

Date Published:: 2016-10-01

Journal Name:: IEEE Conference on e-Science

Page Range / eLocation ID:: 61 to 70

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/eScience.2016.7870886

More Like this