Measuring Stochastic Data Complexity with Boltzmann Influence Functions

Ng, N; Grosse, R; Ghassemi, M

Citation Details

Estimating the uncertainty of a model’s prediction on a test point is a crucial part of ensuring reliability and calibration under distribution shifts. A minimum description length approach to this problem uses the predictive normalized maximum likelihood (pNML) distribution, which considers every possible label for a data point, and decreases confidence in a prediction if other labels are also consistent with the model and training data. In this work we propose IF-COMP, a scalable and efficient approximation of the pNML distribution that linearizes the model with a temperature- scaled Boltzmann influence function. IF-COMP can be used to produce well-calibrated predictions on test points as well as measure complexity in both labelled and unlabelled settings. We experimentally validate IF-COMP on uncertainty calibration, mislabel detection, and OOD detection tasks, where it consistently matches or beats strong baseline methods. more »

Award ID(s):: 2339381

PAR ID:: 10580636

Author(s) / Creator(s):: Ng, N; Grosse, R; Ghassemi, M

Publisher / Repository:: Proceedings of the 41st International Conference on Machine Learning (ICML)

Date Published:: 2024-07-21

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this