An Adaptive Benchmark for Modeling User Exploration of Large Datasets

Purich, Joanna; Wise, Anthony; Battle, Leilani

doi:10.1145/3709658

Citation Details

This content will become publicly available on February 10, 2026

An Adaptive Benchmark for Modeling User Exploration of Large Datasets

In this paper, we present a new DBMS performance benchmark that cansimulateuser exploration with any specified dashboard design made of standard visualization and interaction components. The distinguishing feature of our SImulation-BAsed (or SIMBA) benchmark is its ability tomodel user analysis goalsas a set of SQL queries to be generated through a valid sequence of user interactions, as well asmeasure the completion of analysis goalsby testing for equivalence between the user's previous queries and their goal queries. In this way, the SIMBA benchmark can simulate how an analyst opportunistically searches for interesting insights at the beginning of an exploration session and eventually hones in on specific goals towards the end. To demonstrate the versatility of the SIMBA benchmark, we use it to test the performance of four DBMSs with six different dashboard specifications and compare our results with IDEBench. Our results show how goal-driven simulation can reveal gaps in DBMS performance missed by existing benchmarking methods and across a range of data exploration scenarios. more »

Award ID(s):: 2141506

PAR ID:: 10625374

Author(s) / Creator(s):: Purich, Joanna; Wise, Anthony; Battle, Leilani

Publisher / Repository:: ACM

Date Published:: 2025-02-10

Journal Name:: Proceedings of the ACM on Management of Data

Volume:: 3

Issue:: 1

ISSN:: 2836-6573

Page Range / eLocation ID:: 1 to 24

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on February 10, 2026
Journal Article:
https://doi.org/10.1145/3709658

More Like this