Learned Offline Query Planning via Bayesian Optimization

Tao, Jeffrey; Maus, Natalie; Jones, Haydn; Zeng, Yimeng; Gardner, Jacob R; Marcus, Ryan

doi:10.1145/3725316

Citation Details

This content will become publicly available on June 17, 2026

Learned Offline Query Planning via Bayesian Optimization

Analytics database workloads often contain queries that are executed repeatedly. Existing optimization techniques generally prioritize keeping optimization cost low, normally well below the time it takes to execute a single instance of a query. If a given query is going to be executed thousands of times, could it be worth investing significantly more optimization time? In contrast to traditional online query optimizers, we propose an offline query optimizer that searches a wide variety of plans and incorporates query execution as a primitive. Our offline query optimizer combines variational auto-encoders with Bayesian optimization to find optimized plans for a given query. We compare our technique to the optimal plans possible with PostgreSQL and recent RL-based systems over several datasets, and show that our technique finds faster query plans. more »

Award ID(s):: 2400135

PAR ID:: 10616236

Author(s) / Creator(s):: Tao, Jeffrey; Maus, Natalie; Jones, Haydn; Zeng, Yimeng; Gardner, Jacob R; Marcus, Ryan

Publisher / Repository:: Proc ACM

Date Published:: 2025-06-17

Journal Name:: Proceedings of the ACM on Management of Data

Volume:: 3

Issue:: 3

ISSN:: 2836-6573

Page Range / eLocation ID:: 1 to 29

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on June 17, 2026
Journal Article:
https://doi.org/10.1145/3725316

More Like this