Principal Component Networks: Parameter Reduction Early in Training

Waleffe, Roger; Rekatsinas, Theodoros

Citation Details

In this paper, we show that hidden layer activations in overparameterized neural networks for image classification exist primarily in subspaces smaller than the actual model width. We further show that these subspaces can be identified early in training. Based on these observations, we show how to efficiently find small networks that exhibit similar accuracy to their overparameterized counterparts after only a few training epochs. We term these network architectures Principal Component Networks (PCNs). We evaluate PCNs on CIFAR-10 and ImageNet for VGG and ResNet style architectures and find that PCNs consistently reduce parameter counts with little accuracy loss, thus providing the potential to reduce the compu- tational costs of deep neural network training. more »

Award ID(s):: 1815538

PAR ID:: 10385069

Author(s) / Creator(s):: Waleffe, Roger; Rekatsinas, Theodoros

Date Published:: 2022-07-23

Journal Name:: International Conference on Machine Learning Workshop on Hardware Aware Efficient Training

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this