Engineering a High-Performance GPU B-Tree

Awad, Muhammad A.; Ashkiani, Saman; Johnson, Rob; Farach-Colton, Martín; Owens, John D.

doi:10.1145/3293883.3295706

Citation Details

Engineering a High-Performance GPU B-Tree

We engineer a GPU implementation of a B-Tree that supports concurrent queries (point, range, and successor) and updates (insertions and deletions). Our B-tree outperforms the state of the art, a GPU log-structured merge tree (LSM) and a GPU sorted array. In particular, point and range queries are significantly faster than in a GPU LSM (the GPU LSM does not implement successor queries). Furthermore, B-Tree insertions are also faster than LSM and sorted array insertions unless insertions come in batches of more than roughly 100k. Because we cache the upper levels of the tree, we achieve lookup throughput that exceeds the DRAM bandwidth of the GPU. We demonstrate that the key limiter of performance on a GPU is contention and describe the design choices that allow us to achieve this high performance. more »

Award ID(s):: 1637442 1745331

PAR ID:: 10101116

Author(s) / Creator(s):: Awad, Muhammad A.; Ashkiani, Saman; Johnson, Rob; Farach-Colton, Martín; Owens, John D.

Date Published:: 2019-02-01

Journal Name:: Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming

Page Range / eLocation ID:: 145 to 157

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
https://doi.org/10.1145/3293883.3295706

More Like this