BlinkNet: Software-Defined Deep Learning Analytics with Bounded Resources

Koga, Brian; Vanderweide, Theresa; Zhao, Xinghui; Zhang, Xuechen

Citation Details

Deep neural networks (DNNs) have recently gained unprecedented success in various domains. In resource-constrained systems, QoS-aware DNNs are designed to meet latency requirements of mission-critical deep learning applications. However, none of the existing DNNs have been designed to satisfy both latency and memory bounds simultaneously as specified by end-users in the resource-constrained systems. In this paper, we propose BLINKNET, a runtime system that is able to guarantee both latency and memory/storage bounds via efficient QoS-aware per-layer approximation. We implement BLINKNET in Apache TVM and evaluate it using Cifar10-quick and VGG network models. Our experimental results show that BLINKNET can meet the latency and memory requirements with 2% accuracy loss on average. more »

Award ID(s):: 1906541

PAR ID:: 10290174

Author(s) / Creator(s):: Koga, Brian; Vanderweide, Theresa; Zhao, Xinghui; Zhang, Xuechen

Date Published:: 2021-01-18

Journal Name:: 3rd Workshop on Accelerated Machine Learning (AccML) Co-located with the HiPEAC 2021 Conference

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this