Analysis-driven Engineering of Comparison-based Sorting Algorithms on GPUs

Karsin, Ben; Weichert, Volker; Casanova, Henri; Iacono, John; Sitchinava, Nodari

doi:10.1145/3205289.3205298

Citation Details

Analysis-driven Engineering of Comparison-based Sorting Algorithms on GPUs

We study the relationship between memory accesses, bank conflicts, thread multiplicity (also known as over-subscription) and instruction-level parallelism in comparison-based sort- ing algorithms for Graphics Processing Units (GPUs). We experimentally validate a proposed formula that relates these parameters with asymptotic analysis of the number of mem- ory accesses by an algorithm. Using this formula we analyze and compare several GPU sorting algorithms, identifying key performance bottlenecks in each one of them. Based on this analysis we propose a GPU-efficient multiway merge- sort algorithm, GPU-MMS, which minimizes or eliminates these bottlenecks and balances various limiting factors for specific hardware. We realize an implementation of GPU-MMS and compare it to sorting algorithm implementations in state-of-the-art GPU libraries on three GPU architectures. Despite these library implementations being highly optimized, we find that GPU-MMS outperforms them by an average of 21% for random integer inputs and 14% for random key-value pairs. more »

Award ID(s):: 1745331

PAR ID:: 10129687

Author(s) / Creator(s):: Karsin, Ben; Weichert, Volker; Casanova, Henri; Iacono, John; Sitchinava, Nodari

Date Published:: 2018-06-01

Journal Name:: Proceedings of the 2018 International Conference on Supercomputing (ICS)

Page Range / eLocation ID:: 86 to 95

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3205289.3205298

More Like this