CM-GCN: A Distributed Framework for Graph Convolutional Networks using Cohesive Mini-batches

Zhao, Guoyi; Zhou, Tian; Gao, Lixin

doi:10.1109/BigData52589.2021.9671931

Citation Details

CM-GCN: A Distributed Framework for Graph Convolutional Networks using Cohesive Mini-batches

Graph convolutional network (GCN) has been shown effective in many applications with graph structures. However, training a large-scale GCN is still challenging due to the high computation cost that grows with the size of the graph. In this paper, we propose CM-GCN, a distributed GCN framework using cohesive mini-batches to accelerate large-scale GCN training. The cohesive mini-batches group nodes that are tightly connected in the graph. As a result, CM-GCN can reduce the computation required to train a GCN. We propose a computation cost function to quantify the computation required for mini-batches. By exploring the submodular property of the computation cost function, we develop an efficient algorithm to partition nodes into tightly coupled mini-batches. Based on the computation cost function, we evenly distribute the workloads of mini-batches to workers. We design asynchronous computations between GCN layers to further eliminating the waiting among workers. We implement a CM-GCN framework and evaluate its performance with graphs that contain millions of nodes. Our evaluation shows that CM-GCN can achieve up to 3X speedup without compromising the training accuracy. more »

Award ID(s):: 1908536

PAR ID:: 10356562

Author(s) / Creator(s):: Zhao, Guoyi; Zhou, Tian; Gao, Lixin

Date Published:: 2021-12-15

Journal Name:: IEEE International Conference on Big Data (Big Data)

Page Range / eLocation ID:: 153 to 163

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/BigData52589.2021.9671931

More Like this