InferDB: In-Database Machine Learning Inference Using Indexes

Salazar-Díaz, Ricardo; Glavic, Boris; Rabl, Tilmann

doi:10.14778/3659437.3659441

Citation Details

InferDB: In-Database Machine Learning Inference Using Indexes

The performance of inference with machine learning (ML) models and its integration with analytical query processing have become critical bottlenecks for data analysis in many organizations. An ML inference pipeline typically consists of a preprocessing workflow followed by prediction with an ML model. Current approaches for in-database inference implement preprocessing operators and ML algorithms in the database either natively, by transpiling code to SQL, or by executing user-defined functions in guest languages such as Python. In this work, we present a radically different approach that approximates an end-to-end inference pipeline (preprocessing plus prediction) using a light-weight embedding that discretizes a carefully selected subset of the input features and an index that maps data points in the embedding space to aggregated predictions of an ML model. We replace a complex preprocessing workflow and model-based inference with a simple feature transformation and an index lookup. Our framework improves inference latency by several orders of magnitude while maintaining similar prediction accuracy compared to the pipeline it approximates. more »

Award ID(s):: 2420577 2420691 2107107 1956123

PAR ID:: 10544898

Author(s) / Creator(s):: Salazar-Díaz, Ricardo; Glavic, Boris; Rabl, Tilmann

Publisher / Repository:: VLDB Endowment

Date Published:: 2024-04-01

Journal Name:: Proceedings of the VLDB Endowment

Volume:: 17

Issue:: 8

ISSN:: 2150-8097

Page Range / eLocation ID:: 1830 to 1842

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.14778/3659437.3659441

More Like this