Compact, tamper-resistant archival of fine-grained provenance

Zheng, Nan; Ives, Zachary G.

doi:10.14778/3436905.3436909

Citation Details

Compact, tamper-resistant archival of fine-grained provenance

Data provenance tools aim to facilitate reproducible data science and auditable data analyses, by tracking the processes and inputs responsible for each result of an analysis. Fine-grained provenance further enables sophisticated reasoning about why individual output results appear or fail to appear. However, for reproducibility and auditing, we need a provenance archival system that is tamper-resistant , and efficiently stores provenance for computations computed over time (i.e., it compresses repeated results). We study this problem, developing solutions for storing fine-grained provenance in relational storage systems while both compressing and protecting it via cryptographic hashes. We experimentally validate our proposed solutions using both scientific and OLAP workloads. more »

Award ID(s):: 1640813 1910108 1547360 1763514

PAR ID:: 10290599

Author(s) / Creator(s):: Zheng, Nan; Ives, Zachary G.

Date Published:: 2020-12-01

Journal Name:: Proceedings of the VLDB Endowment

Volume:: 14

Issue:: 4

ISSN:: 2150-8097

Page Range / eLocation ID:: 485 to 497

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.14778/3436905.3436909

More Like this