Join Size Bounds using l p -Norms on Degree Sequences

Abo_Khamis, Mahmoud; Nakos, Vasileios; Olteanu, Dan; Suciu, Dan

doi:10.1145/3651597

Citation Details

Join Size Bounds using l p -Norms on Degree Sequences

Estimating the output size of a query is a fundamental yet longstanding problem in database query processing. Traditional cardinality estimators used by database systems can routinely underestimate the true output size by orders of magnitude, which leads to significant system performance penalty. Recently, upper bounds have been proposed that are based on information inequalities and incorporate sizes and max-degrees from input relations, yet their main benefit is limited to cyclic queries, because they degenerate to rather trivial formulas on acyclic queries. We introduce a significant extension of the upper bounds, by incorporating l_p-norms of the degree sequences of join attributes. Our bounds are significantly lower than previously known bounds, even when applied to acyclic queries. These bounds are also based on information theory, they come with a matching query evaluation algorithm, are computable in exponential time in the query size, and are provably tight when all degrees are ''simple''. more »

Award ID(s):: 2314527 2312195 2109922

PAR ID:: 10518842

Author(s) / Creator(s):: Abo_Khamis, Mahmoud; Nakos, Vasileios; Olteanu, Dan; Suciu, Dan

Publisher / Repository:: ACM

Date Published:: 2024-05-10

Journal Name:: Proceedings of the ACM on Management of Data

Volume:: 2

Issue:: 2

ISSN:: 2836-6573

Page Range / eLocation ID:: 1 to 24

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3651597

More Like this