SafeBound: A Practical System for Generating Cardinality Bounds

Deeds, Kyle B.; Suciu, Dan; Balazinska, Magdalena

doi:10.1145/3588907

Citation Details

This content will become publicly available on May 26, 2024

SafeBound: A Practical System for Generating Cardinality Bounds

Recent work has reemphasized the importance of cardinality estimates for query optimization. While new techniques have continuously improved in accuracy over time, they still generally allow for under-estimates which often lead optimizers to make overly optimistic decisions. This can be very costly for expensive queries. An alternative approach to estimation is cardinality bounding, also called pessimistic cardinality estimation, where the cardinality estimator provides guaranteed upper bounds of the true cardinality. By never underestimating, this approach allows the optimizer to avoid potentially inefficient plans. However, existing pessimistic cardinality estimators are not yet practical: they use very limited statistics on the data, and cannot handle predicates. In this paper, we introduce SafeBound, the first practical system for generating cardinality bounds. SafeBound builds on a recent theoretical work that uses degree sequences on join attributes to compute cardinality bounds, extends this framework with predicates, introduces a practical compression method for the degree sequences, and implements an efficient inference algorithm. Across four workloads, SafeBound achieves up to 80% lower end-to-end runtimes than PostgreSQL, and is on par or better than state of the art ML-based estimators and pessimistic cardinality estimators, by improving the runtime of the expensive queries. It also saves up to 500x in query planning time, and uses up to 6.8x less space compared to state of the art cardinality estimation methods. more »

Award ID(s):: 1907997 2109922

NSF-PAR ID:: 10428134

Author(s) / Creator(s):: Deeds, Kyle B.; Suciu, Dan; Balazinska, Magdalena

Date Published:: 2023-05-26

Journal Name:: Proceedings of the ACM on Management of Data

Volume:: 1

Issue:: 1

ISSN:: 2836-6573

Page Range / eLocation ID:: 1 to 26

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on May 26, 2024
Journal Article:
https://doi.org/10.1145/3588907

More Like this