Maximal Directed Quasi-Clique Mining

Guo, Guimu; Yan, Da; Yuan, Lyuheng; Khalil, Jalal; Long, Cheng; Jiang, Zhe; Zhou, Yang

Citation Details

Quasi-cliques are a type of dense subgraphs that generalize the notion of cliques, important for applications such as community/module detection in various social and biological networks. However, the existing quasi-clique definition and algorithms are only applicable to undirected graphs. In this paper, we generalize the concept of quasi-cliques to directed graphs by proposing $$(\gamma_1, \gamma_2)$$-quasi-cliques which have density requirements in both inbound and outbound directions of each vertex in a quasi-clique subgraph. An efficient recursive algorithm is proposed to find maximal $$(\gamma_1, \gamma_2)$$-quasi-cliques which integrates many effective pruning rules that are validated by ablation studies. We also study the finding of top-$$k$$ large quasi-cliques directly by bootstrapping the search from more compact quasi-cliques, to scale the mining to larger networks. The algorithms are parallelized with effective load balancing, and we demonstrate that they can scale up effectively with the number of CPU cores. more »

Award ID(s):: 1755464

PAR ID:: 10331905

Author(s) / Creator(s):: Guo, Guimu; Yan, Da; Yuan, Lyuheng; Khalil, Jalal; Long, Cheng; Jiang, Zhe; Zhou, Yang

Date Published:: 2022-01-01

Journal Name:: Proceedings of the 38th IEEE International Conference on Data Engineering (ICDE)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this