Limitations of Post-Hoc Feature Alignment for Robustness

Burns, Collin; Steinhardt, Jacob

Citation Details

Feature alignment is an approach to improving robustness to distribution shift that matches the distribution of feature activations between the training distribution and test distribution. A particularly simple but effective approach to feature alignment involves aligning the batch normalization statistics between the two distributions in a trained neural network. This technique has received renewed interest lately because of its impressive performance on robustness benchmarks. However, when and why this method works is not well understood. We investigate the approach in more detail and identify several limitations. We show that it only significantly helps with a narrow set of distribution shifts and we identify several settings in which it even degrades performance. We also explain why these limitations arise by pinpointing why this approach can be so effective in the first place. Our findings call into question the utility of this approach and Unsupervised Domain Adaptation more broadly for improving robustness in practice. more »

Award ID(s):: 2031899

PAR ID:: 10251087

Author(s) / Creator(s):: Burns, Collin; Steinhardt, Jacob

Date Published:: 2021-01-01

Journal Name:: IEEE Computer Society Conference on Computer Vision and Pattern Recognition

ISSN:: 2332-564X

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this