Pre-defined Sparsity for Low-Complexity Convolutional Neural Networks

Kundu, Souvik; Nazemi, Mahdi; Pedram, Massoud; Chugg, Keith M.; Beeral, Peter

doi:10.1109/TC.2020.2972520

Citation Details

Pre-defined Sparsity for Low-Complexity Convolutional Neural Networks

The high energy cost of processing deep convolutional neural networks impedes their ubiquitous deployment in energy-constrained platforms such as embedded systems and IoT devices. This article introduces convolutional layers with pre-defined sparse 2D kernels that have support sets that repeat periodically within and across filters. Due to the efficient storage of our periodic sparse kernels, the parameter savings can translate into considerable improvements in energy efficiency due to reduced DRAM accesses, thus promising significant improvements in the trade-off between energy consumption and accuracy for both training and inference. To evaluate this approach, we performed experiments with two widely accepted datasets, CIFAR-10 and Tiny ImageNet in sparse variants of the ResNet18 and VGG16 architectures. Compared to baseline models, our proposed sparse variants require up to ∼82% fewer model parameters with 5.6× fewer FLOPs with negligible loss in accuracy for ResNet18 on CIFAR-10. For VGG16 trained on Tiny ImageNet, our approach requires 5.8× fewer FLOPs and up to ∼83.3% fewer model parameters with a drop in top-5 (top-1) accuracy of only 1.2% ( ∼2.1% ). We also compared the performance of our proposed architectures with that of ShuffleNet and MobileNetV2. Using similar hyperparameters and FLOPs, our ResNet18 variants yield an average accuracy improvement of ∼2.8% . more »

Award ID(s):: 1763747

PAR ID:: 10163056

Author(s) / Creator(s):: Kundu, Souvik; Nazemi, Mahdi; Pedram, Massoud; Chugg, Keith M.; Beeral, Peter

Date Published:: 2020-02-10

Journal Name:: IEEE Transactions on Computers

Volume:: 69

Issue:: 7

ISSN:: 0018-9340

Page Range / eLocation ID:: 1045 to 1058

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/TC.2020.2972520

More Like this