Parallel Suffix Sorting for Large String Analytics

Zhihui Du; Sen Zhang; David Bader

doi:10.1007/978-3-031-30442-2_6

Citation Details

Parallel Suffix Sorting for Large String Analytics

The suffix array is a fundamental data structure to support string analysis efficiently. It took about 26 years for the sequential suffix array construction algorithm to achieve O(n) time complexity and inplace sorting. In this paper, we develop the DLPI (D Limited Parallel Induce) algorithm, the first O( n p ) time parallel suffix array construction algorithm. The basic idea of DLPI includes two aspects: dividing the O(n) size problem into p reduced sub-problems with size O( n/p ) so we can handle them on p processors in parallel; developing an efficient parallel induce sorting method to achieve correct order for all the reduced sub-problems. The complete algorithm description is given to show the implementation method of the proposed idea. The time and space complexity analysis and proof are also given to show the correctness and efficiency of the proposed algorithm. The proposed DLPI algorithm can handle large strings with scalable performance. more »

Award ID(s):: 2109988

PAR ID:: 10385347

Author(s) / Creator(s):: Zhihui Du; Sen Zhang; David Bader

Date Published:: 2022-09-11

Journal Name:: 14th International Conference on Parallel Processing and Applied Mathematics (PPAM)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
https://doi.org/10.1007/978-3-031-30442-2_6

More Like this