Community Detection with Secondary Latent Variables

Esmaeili, Mohammad; Nosratinia, Aria

doi:10.1109/ISIT44484.2020.9174105

Citation Details

Community Detection with Secondary Latent Variables

Community detection refers to recovering a (latent) label on which the distribution of the observed graph depends. Recent work has also investigated the impact of additionally knowing the value of another variable at each vertex that is correlated with the vertex label (side information), while assuming side information is independent of the graph edges conditioned on the label. This work extends the scope of community detection in two ways. First, we consider a side information that does not form a Markov chain with the label and graph, and analyze the detection threshold of semidefinite programming subject to knowledge of this side information, which is a non-label latent variable on which the graph edges also depend. In the second part of the work, we consider aside from vertex labels a second latent variable that is unknown both in realization and in distribution. We then investigate the performance of the semidefinite programming community detection as a function of the (unknown) composition of the nuisance latent variable. In both cases, it is shown that semidefinite programming can achieve exact recovery down to the optimal (information theoretic) threshold. more »

Award ID(s):: 1711689

PAR ID:: 10189021

Author(s) / Creator(s):: Esmaeili, Mohammad; Nosratinia, Aria

Date Published:: 2020-06-01

Journal Name:: International Symposium on Information Theory

Page Range / eLocation ID:: 1355 to 1360

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ISIT44484.2020.9174105

More Like this