Provenance-based data skipping

Niu, Xing; Glavic, Boris; Liu, Ziyu; Li, Pengyuan; Gawlick, Dieter; Krishnaswamy, Vasudha; Liu, Zhen Hua; Porobic, Danica

doi:10.14778/3494124.3494130

Citation Details

Provenance-based data skipping

Database systems use static analysis to determine upfront which data is needed for answering a query and use indexes and other physical design techniques to speed-up access to that data. However, for important classes of queries, e.g., HAVING and top-k queries, it is impossible to determine up-front what data is relevant. To overcome this limitation, we develop provenance-based data skipping (PBDS), a novel approach that generates provenance sketches to concisely encode what data is relevant for a query. Once a provenance sketch has been captured it is used to speed up subsequent queries. PBDS can exploit physical design artifacts such as indexes and zone maps. more »

Award ID(s):: 2107107 1956123

PAR ID:: 10358636

Author(s) / Creator(s):: Niu, Xing; Glavic, Boris; Liu, Ziyu; Li, Pengyuan; Gawlick, Dieter; Krishnaswamy, Vasudha; Liu, Zhen Hua; Porobic, Danica

Date Published:: 2021-11-01

Journal Name:: Proceedings of the VLDB Endowment

Volume:: 15

Issue:: 3

ISSN:: 2150-8097

Page Range / eLocation ID:: 451 to 464

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.14778/3494124.3494130

More Like this