NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

QUIDAM: A Framework for Qu ant i zation-Aware D NN A ccelerator and M odel Co-Exploration

https://doi.org/10.1145/3555807

Inci, Ahmet; Virupaksha, Siri Garudanagiri; Jain, Aman; Chin, Ting-Wu; Thallam, Venkata Vivek; Ding, Ruizhou; Marculescu, Diana (September 2022, ACM Transactions on Embedded Computing Systems)

As the machine learning and systems communities strive to achieve higher energy-efficiency through custom deep neural network (DNN) accelerators, varied precision or quantization levels, and model compression techniques, there is a need for design space exploration frameworks that incorporate quantization-aware processing elements into the accelerator design space while having accurate and fast power, performance, and area models. In this work, we present QUIDAM , a highly parameterized quantization-aware DNN accelerator and model co-exploration framework. Our framework can facilitate future research on design space exploration of DNN accelerators for various design choices such as bit precision, processing element type, scratchpad sizes of processing elements, global buffer size, number of total processing elements, and DNN configurations. Our results show that different bit precisions and processing element types lead to significant differences in terms of performance per area and energy. Specifically, our framework identifies a wide range of design points where performance per area and energy varies more than 5 × and 35 ×, respectively. With the proposed framework, we show that lightweight processing elements achieve on par accuracy results and up to 5.7 × more performance per area and energy improvement when compared to the best INT16 based implementation. Finally, due to the efficiency of the pre-characterized power, performance, and area models, QUIDAM can speed up the design exploration process by 3-4 orders of magnitude as it removes the need for expensive synthesis and characterization of each design.
more » « less
Full Text Available
ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection

https://doi.org/10.1109/WACV45572.2020.9093418

Chen, Zhuo; Zhang, Jiyuan; Ding, Ruizhou; Marculescu, Diana (March 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV))

In recent years, Convolutional Neural Networks (CNNs) have shown superior capability in visual learning tasks. While accuracy-wise CNNs provide unprecedented performance, they are also known to be computationally intensive and energy demanding for modern computer systems. In this paper, we propose Virtual Pooling (ViP), a model-level approach to improve speed and energy consumption of CNN-based image classification and object detection tasks, with a provable error bound. We show the efficacy of ViP through experiments on four CNN models, three representative datasets, both desktop and mobile platforms, and two visual learning tasks, i.e., image classification and object detection. For example, ViP delivers 2.1x speedup with less than 1.5% accuracy degradation in ImageNet classification on VGG16, and 1.8x speedup with 0.025 mAP degradation in PASCAL VOC object detection with Faster-RCNN. ViP also reduces mobile GPU and CPU energy consumption by up to 55% and 70%, respectively. As a complementary method to existing acceleration approaches, ViP achieves 1.9x speedup on ThiNet leading to a combined speedup of 5.23x on VGG16. Furthermore, ViP provides a knob for machine learning practitioners to generate a set of CNN models with varying trade-offs between system speed/energy consumption and accuracy to better accommodate the requirements of their tasks. Code is available at https://github.com/cmu-enyac/VirtualPooling.
more » « less
Full Text Available
Single-Path Mobile AutoML: Efficient ConvNet Design and NAS Hyperparameter Optimization

https://doi.org/10.1109/JSTSP.2020.2971421

Stamoulis, Dimitrios; Ding, Ruizhou; Wang, Di; Lymberopoulos, Dimitrios; Priyantha, Bodhi; Liu, Jie; Marculescu, Diana (May 2020, IEEE Journal of Selected Topics in Signal Processing)

Full Text Available
IPSA: Integer Programming via Sparse Approximation for Efficient Test-Chip Design

https://doi.org/10.1109/ICCD46524.2019.00011

Huang, Qicheng; Fang, Chenlei; Liu, Zeye; Ding, Ruizhou; Blanton, R. D. (November 2019, 2019 IEEE 37th International Conference on Computer Design (ICCD))

Full Text Available
AdaScale: Towards Real-time Video Object Detection Using Adaptive Scaling

Chin, Ting-Wu; Ding, Ruizhou; Marculescu, Diana (April 2019, Systems and Machine Learning Conference)

Full Text Available
Regularizing Activation Distribution for Training Binarized Deep Networks

Ding, Ruizhou; Chin, Ting-Wu; Liu, Zeye; Marculescu, Diana (June 2019, IEEE Conference on Computer Vision and Pattern Recognition)

Full Text Available
Single-Path NAS: Designing Hardware-Ecient ConvNets in less than 4 Hours

Stamoulis, Dimitrios; Ding, Ruizhou; Wang, Di; Lymberopoulos, Dimitrios; Priyantha, Bodhi; Liu, Jie; Marculescu, Diana (September 2019, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases)

Full Text Available
FLightNNs: Lightweight Quantized Deep Neural Networks for Fast and Accurate Inference

https://doi.org/10.1145/3316781.3317828

Ding, Ruizhou; Liu, Zeye; Chin, Ting-Wu; Marculescu, Diana; Blanton, R. D. (June 2019, ACM/IEEE Design Automation Conference)

Full Text Available
Lightening the Load with Highly Accurate Storage- and Energy-Efficient LightNNs

https://doi.org/10.1145/3270689

Ding, Ruizhou; Liu, Zeye; Blanton, R. D.; Marculescu, Diana (December 2018, ACM Transactions on Reconfigurable Technology and Systems)

Full Text Available
Single-Path NAS: Device-Aware Efficient ConvNet Design

Stamoulis, Dimitrios; Ding, Ruizhou; Wang, Di; Lymberopoulos, Dimitrios; Priyantha, Bodhi; Liu, Jie; Marculescu, Diana (June 2019, Joint Workshop on On-Device Machine Learning & Compact Deep Neural Network Representations with Industrial Applications (ODML-CDNNRIA) in Conjunction with International Conference on Machine Learning)

Full Text Available

« Prev Next »

Search for: All records