Distributed Computation of Persistent Homology from Partitioned Big Data

Malott, Nicholas O.; Verma, Rishi R.; Singh, Rohit P.; Wilsey, Philip A.

doi:10.1109/Cluster48925.2021.00050

Citation Details

Distributed Computation of Persistent Homology from Partitioned Big Data

Topological Data Analysis is a machine learning method that summarizes the topological features of a space. Persistent Homology (PH) can identify these topological features as they persist within a point cloud; persisting in respect to the connectedness of the point cloud at increasing distances. The utility of PH is apparent in several fields including bioinformatics, network security, and object classification. However, the memory complexity of PH limits the application to relatively small point clouds for low-dimensional topological feature identification. For this reason, numerous approaches to optimize and approximate the PH have been introduced for providing results over large point clouds. One solution, Partitioned Persistent Homology (PPH), has shown favorable approximation on a single node with significant performance improvement. However, the single-node approach is limited by the available system memory, leading to the need for a distributed approach for additional (especially memory) resources. This paper studies a distributed version of PPH for use with large point clouds over a high-performance compute cluster. Experimental results of the distributed algorithm against previous studies is presented along with scalability of the distributed library. more »

Award ID(s):: 1909096

PAR ID:: 10297300

Author(s) / Creator(s):: Malott, Nicholas O.; Verma, Rishi R.; Singh, Rohit P.; Wilsey, Philip A.

Date Published:: 2021-09-01

Journal Name:: IEEE International Conference on Cluster Computing

Page Range / eLocation ID:: 344 to 354

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/Cluster48925.2021.00050

More Like this