Collaborative Inference in Resource-Constrained Edge Networks: Challenges and Opportunities

Ng, Nathan; Souza, Abel; Diggavi, Suhas; Suri, Niranjan; Abdelzaher, Tarek; Towsley, Don; Shenoy, Prashant

doi:10.1109/MILCOM61039.2024.10773876

Citation Details

Collaborative Inference in Resource-Constrained Edge Networks: Challenges and Opportunities

Many IoT applications have increasingly adopted machine learning (ML) techniques, such as classification and detection, to enhance automation and decision-making processes. With advances in hardware accelerators such as Nvidia’s Jetson embedded GPUs, the computational capabilities of end devices, particularly for ML inference workloads, have significantly improved in recent years. These advances have opened opportunities for distributing computation across the edge network, enabling optimal resource utilization and reducing request latency. Previous research has demonstrated promising results in collaborative inference, where processing units in the edge network, such as end devices and edge servers, collaboratively execute an inference request to minimize latency.This paper explores approaches for implementing collaborative inference on a single model in resource-constrained edge networks, including on-device, device-edge, and edge-edge collaboration. We present preliminary results from proof-of-concept experiments to support each case. We discuss dynamic factors that can impact the performance of these inference execution strategies, such as network variability, thermal constraints, and workload fluctuations. Finally, we outline potential directions for future research. more »

Award ID(s):: 2325956

PAR ID:: 10591367

Author(s) / Creator(s):: Ng, Nathan; Souza, Abel; Diggavi, Suhas; Suri, Niranjan; Abdelzaher, Tarek; Towsley, Don; Shenoy, Prashant

Publisher / Repository:: IEEE

Date Published:: 2024-10-28

ISBN:: 979-8-3503-7423-0

Page Range / eLocation ID:: 1 to 6

Format(s):: Medium: X

Location:: Washington, DC, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/MILCOM61039.2024.10773876

More Like this