BLEND: A Unified Data Discovery System. In ICDE 2025.

Esmailoghli, Mahdi; Schnell, Christoph; Miller, Renée; Abedjan, Ziawasch

Citation Details

This content will become publicly available on May 20, 2026

BLEND: A Unified Data Discovery System. In ICDE 2025.

Most research on data discovery has so far focused on improving individual discovery operators such as join, correlation, or union discovery. However, in practice, a combination of these techniques and their corresponding indexes may be necessary to support arbitrary discovery tasks. We propose BLEND, a comprehensive data discovery system that supports existing operators and enables their flexible pipelining. BLEND is based on a set of lower-level operators that serve as fundamental building blocks for more complex and sophisticated user tasks. To reduce the execution runtime of discovery pipelines, we propose a unified index structure and a rule- and cost-based optimizer that rewrites SQL statements into low-level operators when possible. We show the superior flexibility and efficiency of our system compared to ad-hoc discovery pipelines and stand-alone solutions. more »

Award ID(s):: 2325632 2107248

PAR ID:: 10614698

Author(s) / Creator(s):: Esmailoghli, Mahdi; Schnell, Christoph; Miller, Renée; Abedjan, Ziawasch

Publisher / Repository:: IEEE

Date Published:: 2025-05-20

Edition / Version:: 2025

ISBN:: 979-8-3315-3603-9

Page Range / eLocation ID:: 737-750

Subject(s) / Keyword(s):: Data Management

Format(s):: Medium: X

Location:: IEEE International Conference on Data Engineering (ICDE)

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on May 20, 2026
Conference Proceeding:
The DOI is not currently available.

More Like this