FastPDB: Towards Bag-Probabilistic Queries at Interactive Speeds

Huber, Aaron; Kennedy, Oliver; Rudra, Atri; Zhao, Zhuoyue; Feng, Su; Glavic, Boris

doi:10.1145/3709691

Citation Details

This content will become publicly available on February 10, 2026

FastPDB: Towards Bag-Probabilistic Queries at Interactive Speeds

Probabilistic databases (PDBs) provide users with a principled way to query data that is incomplete or imprecise. In this work, we study computing expected multiplicities of query results over probabilistic databases under bag semantics which has PTIME data complexity. However, does this imply that bag probabilistic databases are practical? We strive to answer this question from both a theoretical as well as a systems perspective. We employ concepts from fine-grained complexity to demonstrate that exact bag probabilistic query processing is fundamentally less efficient than deterministic bag query evaluation, but that fast approximations are possible by sampling monomials from a circuit representation of a result tuple's lineage. A remaining issue, however, is that constructing such circuits, while in PTIME, can nonetheless have significant overhead. To avoid this cost, we utilize approximate query processing techniques to directly sample monomials without materializing lineage upfront. Our implementation inFastPDBprovides accurate anytime approximation of probabilistic query answers and scales to datasets orders of magnitude larger than competing methods. more »

Award ID(s):: 2420577 2420691

PAR ID:: 10618187

Author(s) / Creator(s):: Huber, Aaron; Kennedy, Oliver; Rudra, Atri; Zhao, Zhuoyue; Feng, Su; Glavic, Boris

Publisher / Repository:: SIGMOD 2025

Date Published:: 2025-02-10

Journal Name:: Proceedings of the ACM on Management of Data

Volume:: 3

Issue:: 1

ISSN:: 2836-6573

Page Range / eLocation ID:: 1 to 25

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on February 10, 2026
Journal Article:
https://doi.org/10.1145/3709691

More Like this