Decentralized Application-Level Adaptive Scheduling for Multi-Instance DNNs on Open Mobile Devices

Sung, Hsin-Hsuan; Chen, Jou-An; Niu, Wei; Guan, Jiexiong; Ren, Bin; Shen, Xipeng

Citation Details

As more apps embrace AI, it is becoming increasingly common that multiple Deep Neural Networks (DNN)-powered apps may run at the same time on a mobile device. This paper explores scheduling in such multi-instance DNN scenarios, on general open mobile systems (e.g., common smartphones and tablets). Unlike closed systems (e.g., autonomous driving systems) where the set of co-run apps is known beforehand, the user of an open mobile system may install or uninstall arbitrary apps at any time, and a centralized solution is subject to adoption barriers. This work proposes the first-known decentralized application-level scheduling mechanism to address the problem. By leveraging the adaptivity of Deep Reinforcement Learning, the solution is shown to make the scheduling of co-run apps converge to a Nash equilibrium point, yielding a good balance of gains among the apps. The solution moreover automatically adapts to the running environment and the underlying OS and hardware. Experiments show that the solution consistently produces significant speedups and energy savings across DNN workloads, hardware configurations, and running scenarios. more »

Award ID(s):: 2047516

PAR ID:: 10462303

Author(s) / Creator(s):: Sung, Hsin-Hsuan; Chen, Jou-An; Niu, Wei; Guan, Jiexiong; Ren, Bin; Shen, Xipeng

Date Published:: 2023-01-01

Journal Name:: Proceedings of the 2023 USENIX Annual Technical Conference

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this