NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Efficient Algorithms for Cardinality Estimation and Conjunctive Query Evaluation With Simple Degree Constraints

https://doi.org/10.1145/3725233

Im, Sungjin; Moseley, Benjamin; Ngo, Hung; Pruhs, Kirk (June 2025, Proceedings of the ACM on Management of Data)

Cardinality estimation and conjunctive query evaluation are two of the most fundamental problems in database query processing. Recent work proposed, studied, and implemented a robust and practical information-theoretic cardinality estimation framework. In this framework, the estimator is the cardinality upper bound of a conjunctive query subject to ''degree-constraints'', which model a rich set of input data statistics. For general degree constraints, computing this bound is computationally hard. Researchers have naturally sought efficiently computable relaxed upper bounds that are as tight as possible. The polymatroid bound is the tightest among those relaxed upper bounds. While it is an open question whether the polymatroid bound can be computed in polynomial-time in general, it is known to be computable in polynomial-time for some classes of degree constraints. Our focus is on a common class of degree constraints called simple degree constraints. Researchers had not previously determined how to compute the polymatroid bound in polynomial time for this class of constraints. Our first main result is a polynomial time algorithm to compute the polymatroid bound given simple degree constraints. Our second main result is a polynomial-time algorithm to compute a ''proof sequence'' establishing this bound. This proof sequence can then be incorporated in the PANDA-framework to give a faster algorithm to evaluate a conjunctive query. In addition, we show computational limitations to extending our results to broader classes of degree constraints. Finally, our technique leads naturally to a new relaxed upper bound called theflow bound,which is computationally tractable.
more » « less
Free, publicly-accessible full text available June 9, 2026
Online Scheduling via Gradient Descent for Weighted Flow Time Minimization

https://doi.org/10.1137/1.9781611978322.128

Chen, Qingyun; Im, Sungjin; Petety, Aditya (January 2025, Society for Industrial and Applied Mathematics (SODA))

Free, publicly-accessible full text available January 1, 2026
Strategic Facility Location via Predictions

Chen, Qingyun; Im, Sungjin; Gravin, Nick (December 2024, WINE 2024: Conference on Web and Internet Economics)

Free, publicly-accessible full text available December 2, 2025
Binary Search with Distributional Predictions

Dinitz, Michael; Im, Sungjin; Lavastida, Thomas; Moseley, Benjamin; Niaparast, Aidin; Vassilvitskii, Sergei (December 2024, Open Review (NeurIPS))

Free, publicly-accessible full text available December 31, 2025
Binary Search with Distributional Predictions

Dinitz, Michael; Im, Sungjin; Lavastida, Thomas; Moseley, Benjamin; Niaparast, Aidin; Vassilvitskii, Sergei (December 2024, Advances in Neural Information Processing Systems 37 (NeurIPS 2024))

Free, publicly-accessible full text available December 15, 2025
Binary Search with Distributional Predictions.

Dinitz, Michael; Im, Sungjin; Lavastida, Thomas; Moseley, Benjamin; Niaparast, Aidin; Vassilvitskii, Sergei (December 2024, NeurIPS)

Free, publicly-accessible full text available December 10, 2025
Polynomial Time Convergence of the Iterative Evaluation of Datalogo Programs

https://doi.org/10.1145/3695839

Im, Sungjin; Moseley, Benjamin; Ngo, Hung Q; Pruhs, Kirk (November 2024, Proceedings of the ACM on Management of Data)

Datalog^ois an extension of Datalog that allows for aggregation and recursion over an arbitrary commutative semiring. Like Datalog, Datalogo programs can be evaluated via the natural iterative algorithm until a fixed point is reached. However unlike Datalog, the natural iterative evaluation of some Datalogo programs over some semirings may not converge. It is known that the commutative semirings for which the iterative evaluation of Datalogo programs is guaranteed to converge are exactly those semirings that are stable. Previously, the best known upper bound on the number of iterations until convergence over p-stable semirings is ∑i=1 ^n (p+2)ⁱ= Θ(pⁿ) steps, where n is (essentially) the output size. We establish that, in fact, the natural iterative evaluation of a Datalogo program over a p-stable semiring converges within a polynomial number of iterations. In particular our upper bound is O(σ p n²( n²lg Λ + lg σ)) where σ is the number of elements in the semiring present in either the input databases or the Datalogo program, and λ is the maximum number of terms in any product in the Datalogo program.
more » « less
Free, publicly-accessible full text available November 4, 2025
Online Load and Graph Balancing for Random Order Inputs

https://doi.org/10.1145/3626183.3659983

Im, Sungjin; Kumar, Ravi; Li, Shi; Petety, Aditya; Purohit, Manish (June 2024, ACM)

Full Text Available
On the Convergence Rate of Linear Datalogo over Stable Semirings

Im, Sungjin; Moseley, Ben; Ngo, Hung; Pruhs, Kirk (March 2024, International Conference on Database Theory)

Full Text Available
Data Exchange Markets via Utility Balancing

https://doi.org/10.1145/3589334.3645364

Bhaskara, Aditya; Gollapudi, Sreenivas; Im, Sungjin; Kollias, Kostas; Munagala, Kamesh; Sankar, Govind S (May 2024, ACM)

Full Text Available

« Prev Next »

Search for: All records