FetchSGD: Communication-Efficient Federated Learning with Sketching

Rothchild, Daniel; Panda, Ashwinee; Ullah, Enayat; Ivkin, Nikita; Stoica, Ion; Braverman, Vladimir; Gonzalez, Joseph; Arora, Raman

Citation Details

Existing approaches to federated learning suffer from a communication bottleneck as well as convergence issues due to sparse client participation. In this paper we introduce a novel algorithm, called FetchSGD, to overcome these challenges. FetchSGD compresses model updates using a Count Sketch, and then takes advantage of the merge-ability of sketches to combine model updates from many workers. A key insight in the design of FetchSGD is that, because the Count Sketch is linear, momentum and error accumulation can both be carried out within the sketch. This allows the algorithm to move momentum and error accumulation from clients to the central aggregator, overcoming the challenges of sparse client participation while still achieving high compression rates and good convergence. We prove that FetchSGD has favorable convergence guarantees, and we demonstrate its empirical effectiveness by training two residual networks and a transformer model. more »

Award ID(s):: 1943251

PAR ID:: 10213718

Author(s) / Creator(s):: Rothchild, Daniel; Panda, Ashwinee; Ullah, Enayat; Ivkin, Nikita; Stoica, Ion; Braverman, Vladimir; Gonzalez, Joseph; Arora, Raman

Date Published:: 2020-07-01

Journal Name:: International Conference on Machine Learning

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this