skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: The Small-Noise Limit of the Most Likely Element is the Most Likely Element in the Small-Noise Limit
Award ID(s):
2143752
PAR ID:
10613262
Author(s) / Creator(s):
;
Publisher / Repository:
Institute for Mathematical Sciences
Date Published:
Journal Name:
Latin American Journal of Probability and Mathematical Statistics
Volume:
21
Issue:
1
ISSN:
1980-0436
Page Range / eLocation ID:
849
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Standard regularized training procedures correspond to maximizing a posterior distribution over parameters, known as maximum a posteriori (MAP) estimation. However, model parameters are of interest only insomuch as they combine with the functional form of a model to provide a function that can make good predictions. Moreover, the most likely parameters under the parameter posterior do not generally correspond to the most likely function induced by the parameter posterior. In fact, we can re-parametrize a model such that any setting of parameters can maximize the parameter posterior. As an alternative, we investigate the benefits and drawbacks of directly estimating the most likely function implied by the model and the data. We show that this procedure leads to pathological solutions when using neural networks and prove conditions under which the procedure is well-behaved, as well as a scalable approximation. Under these conditions, we find that function-space MAP estimation can lead to flatter minima, better generalization, and improved robustness to overfitting. 
    more » « less
  2. Standard regularized training procedures correspond to maximizing a posterior distribution over parameters, known as maximum a posteriori (MAP) estimation. However, model parameters are of interest only insomuch as they combine with the functional form of a model to provide a function that can make good predictions. Moreover, the most likely parameters under the parameter posterior do not generally correspond to the most likely function induced by the parameter posterior. In fact, we can re-parametrize a model such that any setting of parameters can maximize the parameter posterior. As an alternative, we investigate the benefits and drawbacks of directly estimating the most likely function implied by the model and the data. We show that this procedure leads to pathological solutions when using neural networks and prove conditions under which the procedure is well-behaved, as well as a scalable approximation. Under these conditions, we find that function-space MAP estimation can lead to flatter minima, better generalization, and improved robustness to overfitting 
    more » « less
  3. Climate studies based on global climate models (GCMs) project a steady increase in annual average temperature and severe heat extremes in central North America during the mid-century and beyond. However, the agreement of observed trends with climate model trends varies substantially across the region. The present study focuses on two different locations: Des Moines, IA and Austin, TX. In Des Moines, annual extreme temperatures have not increased over the past three decades unlike the trend of regionally-downscaled GCM data for the Midwest, likely due to a “warming hole” over the area linked to agricultural factors. This warming hole effect is not evident for Austin over the same time period, where extreme temperatures have been higher than projected by regionally-downscaled climate (RDC) forecasts. In consideration of the deviation of such RDC extreme temperature forecasts from observations, this study statistically analyzes RDC data in conjunction with observational data to define for these two cities a 95% prediction interval of heat extreme values by 2040. The statistical model is constructed using a linear combination of RDC ensemble-member annual extreme temperature forecasts with regression coefficients for individual forecasts estimated by optimizing model results against observations over a 52-year training period. 
    more » « less
  4. In this paper, we consider the KPZ equation under the weak noise scaling. That is, we introduce a small parameter \sqrt{\varepsilon} in front of the noise and let \varepsilon \to 0. We prove that the one-point large deviation rate function has a \frac{3}{2} power law in the deep upper tail. Furthermore, by forcing the value of the KPZ equation at a point to be very large, we prove a limit shape of the KPZ equation as \varepsilon \to 0. This confirms the physics prediction in Kolokolov and Korshunov (2007), Kolokolov and Korshunov (2009), Meerson, Katzav, and Vilenkin (2016), Kamenev, Meerson, and Sasorov (2016), and Le Doussal, Majumdar, Rosso, and Schehr (2016). 
    more » « less