DeepTrust^RT: Confidential Deep Neural Inference Meets Real-Time!

Babar, Mohammad Fakhruddin; Hasan, Monowar

doi:10.4230/LIPIcs.ECRTS.2024.13

Citation Details

DeepTrust^RT: Confidential Deep Neural Inference Meets Real-Time!

Deep Neural Networks (DNNs) are becoming common in "learning-enabled" time-critical applications such as autonomous driving and robotics. One approach to protect DNN inference from adversarial actions and preserve model privacy/confidentiality is to execute them within trusted enclaves available in modern processors. However, running DNN inference inside limited-capacity enclaves while ensuring timing guarantees is challenging due to (a) large size of DNN workloads and (b) extra switching between "normal" and "trusted" execution modes. This paper introduces new time-aware scheduling schemes - DeepTrust^RT - to securely execute deep neural inferences for learning-enabled real-time systems. We first propose a variant of EDF (called DeepTrust^RT-LW) that slices each DNN layer and runs them sequentially in the enclave. However, due to extra context switch overheads of individual layer slices, we further introduce a novel layer fusion technique (named DeepTrust^RT-FUSION). Our proposed scheme provides hard real-time guarantees by fusing multiple layers of DNN workload from multiple tasks; thus allowing them to fit and run concurrently within the enclaves while maintaining real-time guarantees. We implemented and tested DeepTrust^RT ideas on the Raspberry Pi platform running OP-TEE+DarkNet-TZ DNN APIs and three DNN workloads (AlexNet-squeezed, Tiny Darknet, YOLOv3-tiny). Compared to the layer-wise partitioning approach (DeepTrust^RT-LW), DeepTrust^RT-FUSION can schedule up to 3x more tasksets and reduce context switches by up to 11.12x. We further demonstrate the efficacy of DeepTrust^RT using a flight controller (ArduPilot) case study and find that DeepTrust^RT-FUSION retains real-time guarantees where DeepTrust^RT-LW becomes unschedulable. more »

Award ID(s):: 2312006

PAR ID:: 10596398

Author(s) / Creator(s):: Babar, Mohammad Fakhruddin; Hasan, Monowar

Editor(s):: Pellizzoni, Rodolfo

Publisher / Repository:: Schloss Dagstuhl – Leibniz-Zentrum für Informatik

Date Published:: 2024-01-01

Volume:: 298

ISSN:: 1868-8969

ISBN:: 978-3-95977-324-9

Page Range / eLocation ID:: 13:1-13:24

Subject(s) / Keyword(s):: DNN TrustZone Real-Time Systems Computer systems organization → Real-time systems Security and privacy → Systems security

Format(s):: Medium: X Size: 24 pages; 1539574 bytes Other: application/pdf

Size(s):: 24 pages 1539574 bytes

Right(s):: Creative Commons Attribution 4.0 International license; info:eu-repo/semantics/openAccess

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.4230/LIPIcs.ECRTS.2024.13

More Like this