Convergo: Multi-SLO-Aware Scheduling for Heterogeneous AI Accelerators on Edge Devices

Jiang, Ting; Hao, Jianwei; Harsha, Sushruth; Rachmanto, Rakandhiya D; Setyanto, Arief; Ramaswamy, Lakshmish; Kim, In Kee

doi:10.1109/EDGE67623.2025.00022

Citation Details

This content will become publicly available on July 7, 2026

Convergo: Multi-SLO-Aware Scheduling for Heterogeneous AI Accelerators on Edge Devices

With the growing prevalence of edge AI, systems are increasingly required to meet stringent and diverse service level objectives (SLOs), such as maintaining specific accuracy levels, ensuring sufficient inference throughput, and meeting deadlines, often simultaneously. However, concurrently achieving these varied and complex SLOs is particularly challenging due to the resource constraints of edge devices and the heterogeneity of AI accelerators. To address this gap, we present a novel AI scheduling framework, Convergo, which uniquely integrates heterogeneous accelerator management, multi-tenancy, and multi-SLO prioritization into one scheduling solution. Convergo not only leverages heterogeneous AI accelerators and supports AI multi-tenancy, but also integrates scheduling heuristics to meet multiple SLOs concurrently. Convergo enables the simultaneous satisfaction of multiple/complex SLO requirements (e.g., accuracy, throughput, and deadline constraints). The scheduling algorithm prioritizes inference requests, imposes critical constraints, and selects the best model combinations for current inferencing. We evaluated Convergo on the Jetson Xavier platform with portable TPU accelerators across various AI workloads, demonstrating its effectiveness. The evaluation results show that Convergo outper- forms state-of-the-art baselines, achieving over 90% satisfaction of all three distinct SLO requirements simultaneously while maintaining approximately 95% satisfaction for individual SLOs. Furthermore, Convergo achieves these results with negligible overhead, making it a promising solution for edge AI systems. more »

Award ID(s):: 2416214

PAR ID:: 10634817

Author(s) / Creator(s):: Jiang, Ting; Hao, Jianwei; Harsha, Sushruth; Rachmanto, Rakandhiya D; Setyanto, Arief; Ramaswamy, Lakshmish; Kim, In Kee

Publisher / Repository:: IEEE

Date Published:: 2025-07-07

ISBN:: 979-8-3315-5559-7

Page Range / eLocation ID:: 115 to 125

Subject(s) / Keyword(s):: Scheduling Edge Computing Edge AI AI Acceleration

Format(s):: Medium: X

Location:: Helsinki, Finland

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 7, 2026
Conference Paper:
https://doi.org/10.1109/EDGE67623.2025.00022

More Like this