BPS: Batching, Pipelining, Surgeon of Continuous Deep Inference on Collaborative Edge Intelligence

Hou, Xueyu; Guan, Yongjie; Choi, Nakjung; Han, Tao

doi:10.1109/TCC.2024.3399616

Citation Details

BPS: Batching, Pipelining, Surgeon of Continuous Deep Inference on Collaborative Edge Intelligence

Users on edge generate deep inference requests continuously over time. Mobile/edge devices located near users can undertake the computation of inference locally for users, e.g., the embedded edge device on an autonomous vehicle. Due to limited computing resources on one mobile/edge device, it may be challenging to process the inference requests from users with high throughput. An attractive solution is to (partially) offload the computation to a remote device in the network. In this paper, we examine the existing inference execution solutions across local and remote devices and propose an adaptive scheduler, a BPS scheduler, for continuous deep inference on collaborative edge intelligence. By leveraging data parallel, neurosurgeon, reinforcement learning techniques, BPS can boost the overall inference performance by up to 8.2× over the baseline schedulers. A lightweight compressor, FF, specialized in compressing intermediate output data for neurosurgeon, is proposed and integrated into the BPS scheduler. FF exploits the operating character of convolutional layers and utilizes efficient approximation algorithms. Compared to existing compression methods, FF achieves up to 86.9% lower accuracy loss and up to 83.6% lower latency overhead. more »

Award ID(s):: 2147623 2006604

PAR ID:: 10555234

Author(s) / Creator(s):: Hou, Xueyu; Guan, Yongjie; Choi, Nakjung; Han, Tao

Publisher / Repository:: IEEE Transactions on Cloud Computing

Date Published:: 2024-07-01

Journal Name:: IEEE Transactions on Cloud Computing

Volume:: 12

Issue:: 3

ISSN:: 2372-0018

Page Range / eLocation ID:: 830 to 843

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TCC.2024.3399616

More Like this