DTRL: Decision Tree-based Multi-Objective Reinforcement Learning for Runtime Task Scheduling in Domain-Specific System-on-Chips

Basaklar, Toygun; Goksoy, A Alper; Krishnakumar, Anish; Gumussoy, Suat; Ogras, Umit Y

doi:10.1145/3609108

Citation Details

DTRL: Decision Tree-based Multi-Objective Reinforcement Learning for Runtime Task Scheduling in Domain-Specific System-on-Chips

Domain-specific systems-on-chip (DSSoCs) combine general-purpose processors and specialized hardware accelerators to improve performance and energy efficiency for a specific domain. The optimal allocation of tasks to processing elements (PEs) with minimal runtime overheads is crucial to achieving this potential. However, this problem remains challenging as prior approaches suffer from non-optimal scheduling decisions or significant runtime overheads. Moreover, existing techniques focus on a single optimization objective, such as maximizing performance. This work proposes DTRL, a decision-tree-based multi-objective reinforcement learning technique for runtime task scheduling in DSSoCs. DTRL trains a single global differentiable decision tree (DDT) policy that covers the entire objective space quantified by a preference vector. Our extensive experimental evaluations using our novel reinforcement learning environment demonstrate that DTRL captures the trade-off between execution time and power consumption, thereby generating a Pareto set of solutions using a single policy. Furthermore, comparison with state-of-the-art heuristic–, optimization–, and machine learning-based schedulers shows that DTRL achieves up to 9× higher performance and up to 3.08× reduction in energy consumption. The trained DDT policy achieves 120 ns inference latency on Xilinx Zynq ZCU102 FPGA at 1.2 GHz, resulting in negligible runtime overheads. Evaluation on the same hardware shows that DTRL achieves up to 16% higher performance than a state-of-the-art heuristic scheduler. more »

Award ID(s):: 2114499

PAR ID:: 10537760

Author(s) / Creator(s):: Basaklar, Toygun; Goksoy, A Alper; Krishnakumar, Anish; Gumussoy, Suat; Ogras, Umit Y

Publisher / Repository:: ACM Digital Library

Date Published:: 2023-10-31

Journal Name:: ACM Transactions on Embedded Computing Systems

Volume:: 22

Issue:: 5s

ISSN:: 1539-9087

Page Range / eLocation ID:: 1 to 22

Subject(s) / Keyword(s):: Computer systems organization → System on a chip • Hardware → On-chip re- source management Domain-specific system-on-chip, task scheduling, reinforcement learning, decision trees, resource management, multi-objective optimization

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3609108

More Like this