A statistical approach for neural network pruning with application to internet of things

Mao, Chengchen; Liang, Qilian (ORCID:0000000236308010); Pan, Chenyun; Schizas, Ioannis

doi:10.1186/s13638-023-02254-3

Citation Details

A statistical approach for neural network pruning with application to internet of things

Abstract

Pruning is showing huge potential for compressing and accelerating deep neural networks by eliminating redundant parameters. Along with more terminal chips integrated with AI accelerators for internet of things (IoT) devices, structured pruning is gaining popularity with the edge computing research area. Different from filter pruning and group-wise pruning, stripe-wise pruning (SWP) conducts pruning at the level of stripes in each filter. By introducing filter skeleton (FS) to each stripe, the existing SWP method sets an absolute threshold for the values in FS and removes the stripes whose corresponding values in FS could not meet the threshold. Starting with investigation into the process of stripe wise convolution, we use the statistical properties of the weights located on each stripe to learn the importance between those stripes in a filter and remove stripes with low importance. Our pruned VGG-16 achieves the existing results by a fourfold reduction in parameter with only 0.4% decrease in accuracy. Results from comprehensive experiments on IoT devices are also presented.

NSF-PAR ID:: 10416329

Author(s) / Creator(s):: Mao, Chengchen; Liang, Qilian; Pan, Chenyun; Schizas, Ioannis

Publisher / Repository:: Springer Science + Business Media

Date Published:: 2023-05-27

Journal Name:: EURASIP Journal on Wireless Communications and Networking

Volume:: 2023

Issue:: 1

ISSN:: 1687-1499

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1186/s13638-023-02254-3

More Like this