On the Feasibility and Benefits of Extensive Evaluation

Hui, Yujie; Yu, Miao; Qi, Hao; Gan, Yifan; Li, Tianxi; Li, Yuke; Ren, Xueyuan; Ma, Sixiang; Lu, Xiaoyi; Wang, Yang

doi:10.1145/3677137

Citation Details

On the Feasibility and Benefits of Extensive Evaluation

Benchmark and system parameters often have a significant impact on performance evaluation, which raises a long-lasting question about which settings we should use. This paper studies the feasibility and benefits of extensive evaluation. A full extensive evaluation, which tests all possible settings, is usually too expensive. This work investigates whether it is possible to sample a subset of the settings and, upon them, generate observations that match those from a full extensive evaluation. Towards this goal, we have explored the incremental sampling approach, which starts by measuring a small subset of random settings, builds a prediction model on these samples using the popular ANOVA approach, adds more samples if the model is not accurate enough, and terminates otherwise. To summarize our findings: 1) Enhancing a research prototype to support extensive evaluation mostly involves changing hard-coded configurations, which does not take much effort. 2) Some systems are highly predictable, which means that they can achieve accurate predictions with a low sampling rate, but some systems are less predictable. 3) We have not found a method that can consistently outperform random sampling + ANOVA. Based on these findings, we provide recommendations to improve artifact predictability and strategies for selecting parameter values during evaluation. more »

Award ID(s):: 2118745 2333324

PAR ID:: 10611855

Author(s) / Creator(s):: Hui, Yujie; Yu, Miao; Qi, Hao; Gan, Yifan; Li, Tianxi; Li, Yuke; Ren, Xueyuan; Ma, Sixiang; Lu, Xiaoyi; Wang, Yang

Publisher / Repository:: ACM

Date Published:: 2024-10-01

Journal Name:: Proceedings of the ACM on Management of Data

Volume:: 2

Issue:: 4

ISSN:: 2836-6573

Page Range / eLocation ID:: 1 to 24

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3677137

More Like this