Convex Bounds on the Softmax Function with Applications to Robustness Verification

Wei, Dennis; Wu, Haoze; Wu, Min; Chen, Pin-Yu; Barrett, Clark; Farchi, Eitan

Citation Details

The softmax function is a ubiquitous component at the output of neural networks and increasingly in intermediate layers as well. This paper provides convex lower bounds and concave upper bounds on the softmax function, which are compatible with convex optimization formulations for characterizing neural networks and other ML models. We derive bounds using both a natural exponential-reciprocal decomposition of the softmax as well as an alternative decomposition in terms of the log-sum-exp function. The new bounds are provably and/or numerically tighter than linear bounds obtained in previous work on robustness verification of transformers. As illustrations of the utility of the bounds, we apply them to verification of transformers as well as of the robustness of predictive uncertainty estimates of deep ensembles. more »

Award ID(s):: 2211505

PAR ID:: 10475594

Author(s) / Creator(s):: Wei, Dennis; Wu, Haoze; Wu, Min; Chen, Pin-Yu; Barrett, Clark; Farchi, Eitan

Editor(s):: Ruiz, Francisco; Dy, Jennifer; van de Meent, Jan-Willem

Publisher / Repository:: PMLR

Date Published:: 2023-04-25

Journal Name:: Proceedings of Machine Learning Research

Volume:: 206

Page Range / eLocation ID:: 6853 to 6878

Format(s):: Medium: X

Location:: Valencia, Spain

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this