Search for: All records

Award ID contains: 1838248

« Prev Next »

Total Resources

12

Resource Type
Conference Paper

3

Conference Proceeding

0

Dataset

0

Journal Article

9

Workshop Report

0

Availability
Full Text / Resource Available

11

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

https://doi.org/10.1145/3604437.3604460

Pavlopoulou, Christina ; Carey, Michael J. ; Tsotras, Vassilis J. ( June 2023 , ACM SIGMOD Record)

Effective query optimization remains an open problem for Big Data Management Systems. In this work, we revisit an old idea, runtime dynamic optimization, and adapt it to a big data management system, AsterixDB. The approach runs in stages (re-optimization points), starting by first executing all predicates local to a single dataset. The intermediate result created by a stage is then used to re-optimize the remaining query. This re-optimization approach avoids inaccurate intermediate result cardinality estimates, thus leading to much better execution plans. While it introduces overhead for materializing intermediate results, experiments show that this overhead is relatively small and is an acceptable price to pay given the optimization benefits.
more » « less
Free, publicly-accessible full text available June 7, 2024
Multi-valued indexing in Apache AsterixDB (SI DOLAP 2022)

https://doi.org/10.1016/j.is.2022.102144

Galvizo, Glenn ; Carey, Michael J. ( January 2023 , Information Systems)

Full Text Available
DynaHash: Efficient Data Rebalancing in Apache AsterixDB

https://doi.org/10.1109/icde53745.2022.00041

Luo, Chen ; Carey, Michael J. ( May 2022 , Proc. ICDE Conf.)

Full Text Available
On Multi-Valued Indexing in AsterixDB

Galviso, G. ; Carey, M. ( March 2022 , Int’l. Workshop on Design, Optimization, Languages and Analytical Processing of Big Data (DOLAP 2022), co-located with EDBT 2022)
Stefanidis, K. ; Golab, L. (Ed.)
Secondary indexes in relational database systems are traditionally built under the assumption that one data record maps to one indexed value. Nowadays, particularly in NoSQL systems, single data records can hold collections of values that users want to access efficiently in an ad-hoc manner. Multi-valued indexes aim to give users the best of both worlds: (i) to keep a more natural data model of records with collections of values, and (ii) to reap the benefits of a secondary index. In this paper, we detail the steps taken to realize multi-valued indexes in AsterixDB, a Big Data management system with a structured query language operating over a collection of docu- ments. This includes (a) creating the specification language for such indexes, (b) illustrating data flows for bulk-loading and maintaining an index, and (c) discussing query plans to take advantage of multi-valued indexes for use in predicates with existential and universal quantification. We conclude with ex- periments that compare AsterixDB multi-valued indexes against similar indexes in MongoDB and Couchbase Query.
more » « less
Full Text Available
Design Trade-offs for a Robust Dynamic Hybrid Hash Join

https://doi.org/10.14778/3547305.3547327

Jahangiri, S. ; Carey, M. ; Freytag, C. ( January 2022 , Proceedings of the VLDB Endowment)

Full Text Available
Columnar Formats for Schemaless LSM-based Document Stores

https://doi.org/10.14778/3547305.3547314

Alkowaileet, W. ; Carey, M. ( January 2022 , Proceedings of the VLDB Endowment)

Full Text Available
Breaking down memory walls: adaptive memory management in LSM-based storage systems

https://doi.org/10.14778/3430915.3430916

Luo, Chen ; Carey, Michael J. ( November 2020 , Proceedings of the VLDB Endowment)

Log-Structured Merge-trees (LSM-trees) have been widely used in modern NoSQL systems. Due to their out-of-place update design, LSM-trees have introduced memory walls among the memory components of multiple LSM-trees and between the write memory and the buffer cache. Optimal memory allocation among these regions is non-trivial because it is highly workload-dependent. Existing LSM-tree implementations instead adopt static memory allocation schemes due to their simplicity and robustness, sacrificing performance. In this paper, we attempt to break down these memory walls in LSM-based storage systems. We first present a memory management architecture that enables adaptive memory management. We then present a partitioned memory component structure with new flush policies to better exploit the write memory to minimize the write cost. To break down the memory wall between the write memory and the buffer cache, we further introduce a memory tuner that tunes the memory allocation between these two regions. We have conducted extensive experiments in the context of Apache AsterixDB using the YCSB and TPC-C benchmarks and we present the results here.
more » « less
Full Text Available
Breaking Down Memory Walls in LSM-based Storage Systems

https://doi.org/10.1145/3318464.3384399

Luo, Chen ( May 2020 , Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data (SIGMOD’20))

Full Text Available
LSM-based storage techniques: a survey

https://doi.org/10.1007/s00778-019-00555-y

Luo, Chen ; Carey, Michael J. ( January 2020 , The VLDB Journal)

Full Text Available
On performance stability in LSM-based storage systems

https://doi.org/10.14778/3372716.3372719

Luo, Chen ; Carey, Michael J. ( December 2019 , Proceedings of the VLDB Endowment)

Full Text Available

« Prev Next »