Distributed Framework for Accelerating Training of Deep Learning Models through Prioritization

Zhou, Tian; Gao, Lixin

doi:10.1109/IC2E52221.2021.00035

Citation Details

Distributed Framework for Accelerating Training of Deep Learning Models through Prioritization

Machine learning models such as deep neural networks have been shown to be successful in solving a wide range of problems. Training such a model typically requires stochastic gradient descent, and the process is time-consuming and expensive in terms of computing resources. In this paper, we propose a distributed framework that supports the prioritized execution of the gradient computation. Our proposed distributed framework identifies important data points through computing or estimating the priority for each data point. We evaluate the proposed distributed framework with several machine learning models including multi-layer perceptron (MLP) and convolutional neural networks (CNN). Our experimental results show that prioritized SGD accelerates the training of machine learning models by as much as 1.6X over that of the mini-batch SGD. Further, the distributed framework scales linearly with the number of workers. more »

Award ID(s):: 1908536

PAR ID:: 10356565

Author(s) / Creator(s):: Zhou, Tian; Gao, Lixin

Date Published:: 2021-10-01

Journal Name:: IEEE International Conference on Cloud Engineering (IC2E)

Page Range / eLocation ID:: 201 to 209

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/IC2E52221.2021.00035

More Like this