Optimal Column Layout for Hybrid Workloads

Athanassoulis, Manos; Bøgh, Kenneth; Idreos, Stratos

doi:10.14778/3358701.3358707

Citation Details

Optimal Column Layout for Hybrid Workloads

Data-intensive analytical applications need to support both efficient reads and writes. However, what is usually a good data layout for an update-heavy workload, is not well-suited for a read-mostly one and vice versa. Modern analytical data systems rely on columnar layouts and employ delta stores to inject new data and updates. We show that for hybrid workloads we can achieve close to one order of magnitude better performance by tailoring the column layout design to the data and query workload. Our approach navigates the possible design space of the physical layout: it organizes each column’s data by determining the number of partitions, their corresponding sizes and ranges, and the amount of buffer space and how it is allocated. We frame these design decisions as an optimization problem that, given workload knowledge and performance requirements, provides an optimal physical layout for the workload at hand. To evaluate this work, we build an in-memory storage engine, Casper, and we show that it outperforms state-of-the-art data layouts of analytical systems for hybrid workloads. Casper delivers up to 2.32x higher throughput for update-intensive workloads and up to 2.14x higher throughput for hybrid workloads. We further show how to make data layout decisions robust to workload variation by carefully selecting the input of the optimization. more »

Award ID(s):: 1850202

PAR ID:: 10144830

Author(s) / Creator(s):: Athanassoulis, Manos; Bøgh, Kenneth; Idreos, Stratos

Publisher / Repository:: PVLDB

Date Published:: 2019-09-01

Journal Name:: Proceedings of the VLDB Endowment

Volume:: 12

Issue:: 13

ISSN:: 2150-8097

Page Range / eLocation ID:: 2393-2407

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.14778/3358701.3358707

More Like this