Fast Stochastic Block Partitioning via Sampling

Wanye, Frank; Gleyzer, Vitaliy; Feng, Wu-chun

doi:10.1109/HPEC.2019.8916542

Citation Details

Fast Stochastic Block Partitioning via Sampling

Community detection in graphs, also known as graph partitioning, is a well-studied NP-hard problem. Various heuristic approaches have been adopted to tackle this problem in polynomial time. One such approach, as outlined in the IEEE HPEC Graph Challenge, is Bayesian statistics-based stochastic block partitioning. This method delivers high-quality partitions in sub-quadratic runtime, but it fails to scale to very large graphs. In this paper, we present sampling as an avenue for speeding up the algorithm on large graphs. We first show that existing sampling techniques can preserve a graph’s community structure. We then show that sampling for stochastic block partitioning can be used to produce a speedup of between 2.18× and 7.26× for graph sizes between 5, 000 and 50, 000 vertices without a significant loss in the accuracy of community detection. more »

Award ID(s):: 1822080

PAR ID:: 10188574

Author(s) / Creator(s):: Wanye, Frank; Gleyzer, Vitaliy; Feng, Wu-chun

Date Published:: 2019-09-01

Journal Name:: IEEE High Performance Extreme Computing Conference

Page Range / eLocation ID:: 1 to 7

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/HPEC.2019.8916542

More Like this