Guaranteed Quantization Error Computation for Neural Network Model Compression

Cooke, Wesley; Mo, Zihao; Xiang, Weiming

doi:10.1109/ICIT58465.2023.10143141

Citation Details

Guaranteed Quantization Error Computation for Neural Network Model Compression

Neural network model compression techniques can address the computation issue of deep neural networks on embedded devices in industrial systems. The guaranteed output error computation problem for neural network compression with quantization is addressed in this paper. A merged neural network is built from a feedforward neural network and its quantized version to produce the exact output difference between two neural networks. Then, optimization-based methods and reachability analysis methods are applied to the merged neural network to compute the guaranteed quantization error. Finally, a numerical example is proposed to validate the applicability and effectiveness of the proposed approach. more »

Award ID(s):: 2143351 2223035

PAR ID:: 10437765

Author(s) / Creator(s):: Cooke, Wesley; Mo, Zihao; Xiang, Weiming

Date Published:: 2023-04-04

Journal Name:: 2023 IEEE International Conference on Industrial Technology (ICIT)

Page Range / eLocation ID:: 1 to 4

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICIT58465.2023.10143141

More Like this