Heuristic and Cost-based Optimization for Diverse Provenance Tasks

Niu, Xing; Kapoor, Raghav; Glavic, Boris; Gawlick, Dieter; Liu, Zhen Hua; Krishnaswamy, Vasudha; Radhakrishnan, Venkatesh

doi:10.1109/TKDE.2018.2827074

Citation Details

Heuristic and Cost-based Optimization for Diverse Provenance Tasks

A well-established technique for capturing database provenance as annotations on data is to instrument queries to propagate such annotations. However, even sophisticated query optimizers often fail to produce efficient execution plans for instrumented queries. We develop provenance-aware optimization techniques to address this problem. Specifically, we study algebraic equivalences targeted at instrumented queries and alternative ways of instrumenting queries for provenance capture. Furthermore, we present an extensible heuristic and cost-based optimization framework utilizing these optimizations. Our experiments confirm that these optimizations are highly effective, improving performance by several orders of magnitude for diverse provenance tasks. more »

Award ID(s):: 1640864

PAR ID:: 10082096

Author(s) / Creator(s):: Niu, Xing; Kapoor, Raghav; Glavic, Boris; Gawlick, Dieter; Liu, Zhen Hua; Krishnaswamy, Vasudha; Radhakrishnan, Venkatesh

Date Published:: 2018-04-16

Journal Name:: IEEE Transactions on Knowledge and Data Engineering

ISSN:: 1041-4347

Page Range / eLocation ID:: 1 to 1

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TKDE.2018.2827074

More Like this