MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs

Mao, Jiachen; Yang, Zhongda; Wen, Wei; Wu, Chunpeng; Song, Linghao; Nixon, Kent W.; Chen, Xiang; Li, Hai; Chen, Yiran

doi:10.1109/ICCAD.2017.8203852

Citation Details

MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs

Deep Neural Networks (DNNs) are pervasively used in a significant number of applications and platforms. To enhance the execution efficiency of large-scale DNNs, previous attempts focus mainly on client-server paradigms, relying on powerful external infrastructure, or model compression, with complicated pre-processing phases. Though effective, these methods overlook the optimization of DNNs on distributed mobile devices. In this work, we design and implement MeDNN, a local distributed mobile computing system with enhanced partitioning and deployment tailored for large-scale DNNs. In MeDNN, we first propose Greedy Two Dimensional Partition (GTDP), which can adaptively partition DNN models onto several mobile devices w.r.t. individual resource constraints. We also propose Structured Model Compact Deployment (SMCD), a mobile-friendly compression scheme which utilizes a structured sparsity pruning technique to further accelerate DNN execution. Experimental results show that, GTDP can accelerate the original DNN execution time by 1.86 – 2.44⇥ with 2 – 4 worker nodes. By utilizing SMCD, 26.5% of additional computing time and 14.2% of extra communication time are saved, on average, with negligible effect on the model accuracy. more »

Award ID(s):: 1717657 1725456

PAR ID:: 10063490

Author(s) / Creator(s):: Mao, Jiachen; Yang, Zhongda; Wen, Wei; Wu, Chunpeng; Song, Linghao; Nixon, Kent W.; Chen, Xiang; Li, Hai; Chen, Yiran

Date Published:: 2017-11-01

Journal Name:: IEEE/ACM International Conference on Computer Aided Design

Page Range / eLocation ID:: 751 to 756

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICCAD.2017.8203852

More Like this