skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Molina, Maria J"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract It is widely agreed that subseasonal-to-seasonal (S2S) predictability arises from the atmospheric initial state during early lead times and from the land and ocean during long lead times. We test this hypothesis for the large-scale mid-latitude atmosphere by training numerous XGBoost models to predict weather regimes (WRs) over North America at 1-to-8-week lead times. Each model uses a different predictor from one Earth system component (atmosphere, ocean, or land) sourced from reanalysis. According to the models, the atmosphere provides more predictability during the first two forecast weeks, and the three components performed similarly afterward. However, the skill and sources of predictability are highly dependent on the season and target WR. Our results show greater WR predictability in fall and winter, particularly for the Pacific Trough and Pacific Ridge regimes, driven primarily by the ocean (e.g., El Niño-Southern Oscillation and sea ice). For the Pacific Ridge in winter, the stratosphere also contributes significantly to predictability across most S2S lead times. Additionally, the initial large-scale tropospheric structure (encompassing the tropics and extra-tropics, e.g., Madden-Julian Oscillation) and soil conditions play a relevant role—most notably for the Greenland High regime in winter. This study highlights previously identified sources of predictability for the large-scale atmosphere and gives insight into new sources for future study. Given how closely linked WRs are to surface precipitation and temperature anomalies, storm tracks, and extreme events, the study results contribute to improving S2S prediction of surface weather. 
    more » « less
    Free, publicly-accessible full text available June 19, 2026
  2. Estimating and disentangling epistemic uncertainty, uncertainty that is reducible with more training data, and aleatoric uncertainty, uncertainty that is inherent to the task at hand, is critically important when applying machine learning to highstakes applications such as medical imaging and weather forecasting. Conditional diffusion models’ breakthrough ability to accurately and efficiently sample from the posterior distribution of a dataset now makes uncertainty estimation conceptually straightforward: One need only train and sample from a large ensemble of diffusion models. Unfortunately, training such an ensemble becomes computationally intractable as the complexity of the model architecture grows. In this work we introduce a new approach to ensembling, hyper-diffusion models (HyperDM), which allows one to accurately estimate both epistemic and aleatoric uncertainty with a single model. Unlike existing single-model uncertainty methods like Monte-Carlo dropout and Bayesian neural networks, HyperDM offers prediction accuracy on par with, and in some cases superior to, multi-model ensembles. Furthermore, our proposed approach scales to modern network architectures such as Attention U-Net and yields more accurate uncertainty estimates compared to existing methods. We validate our method on two distinct real-world tasks: x-ray computed tomography reconstruction and weather temperature forecasting. Source code is publicly available at https://github.com/matthewachan/hyperdm. 
    more » « less
    Free, publicly-accessible full text available December 13, 2025
  3. Abstract AI-based algorithms are emerging in many meteorological applications that produce imagery as output, including for global weather forecasting models. However, the imagery produced by AI algorithms, especially by convolutional neural networks (CNNs), is often described as too blurry to look realistic, partly because CNNs tend to represent uncertainty as blurriness. This blurriness can be undesirable since it might obscure important meteorological features. More complex AI models, such as Generative AI models, produce images that appear to be sharper. However, improved sharpness may come at the expense of a decline in other performance criteria, such as standard forecast verification metrics. To navigate any trade-off between sharpness and other performance metrics it is important to quantitatively assess those other metrics along with sharpness. While there is a rich set of forecast verification metrics available for meteorological images, none of them focus on sharpness. This paper seeks to fill this gap by 1) exploring a variety of sharpness metrics from other fields, 2) evaluating properties of these metrics, 3) proposing the new concept of Gaussian Blur Equivalence as a tool for their uniform interpretation, and 4) demonstrating their use for sample meteorological applications, including a CNN that emulates radar imagery from satellite imagery (GREMLIN) and an AI-based global weather forecasting model (GraphCast). 
    more » « less
    Free, publicly-accessible full text available June 9, 2026
  4. Abstract Robust quantification of predictive uncertainty is a critical addition needed for machine learning applied to weather and climate problems to improve the understanding of what is driving prediction sensitivity. Ensembles of machine learning models provide predictive uncertainty estimates in a conceptually simple way but require multiple models for training and prediction, increasing computational cost and latency. Parametric deep learning can estimate uncertainty with one model by predicting the parameters of a probability distribution but does not account for epistemic uncertainty. Evidential deep learning, a technique that extends parametric deep learning to higher-order distributions, can account for both aleatoric and epistemic uncertainties with one model. This study compares the uncertainty derived from evidential neural networks to that obtained from ensembles. Through applications of the classification of winter precipitation type and regression of surface-layer fluxes, we show evidential deep learning models attaining predictive accuracy rivaling standard methods while robustly quantifying both sources of uncertainty. We evaluate the uncertainty in terms of how well the predictions are calibrated and how well the uncertainty correlates with prediction error. Analyses of uncertainty in the context of the inputs reveal sensitivities to underlying meteorological processes, facilitating interpretation of the models. The conceptual simplicity, interpretability, and computational efficiency of evidential neural networks make them highly extensible, offering a promising approach for reliable and practical uncertainty quantification in Earth system science modeling. To encourage broader adoption of evidential deep learning, we have developed a new Python package, Machine Integration and Learning for Earth Systems (MILES) group Generalized Uncertainty for Earth System Science (GUESS) (MILES-GUESS) (https://github.com/ai2es/miles-guess), that enables users to train and evaluate both evidential and ensemble deep learning. Significance StatementThis study demonstrates a new technique, evidential deep learning, for robust and computationally efficient uncertainty quantification in modeling the Earth system. The method integrates probabilistic principles into deep neural networks, enabling the estimation of both aleatoric uncertainty from noisy data and epistemic uncertainty from model limitations using a single model. Our analyses reveal how decomposing these uncertainties provides valuable insights into reliability, accuracy, and model shortcomings. We show that the approach can rival standard methods in classification and regression tasks within atmospheric science while offering practical advantages such as computational efficiency. With further advances, evidential networks have the potential to enhance risk assessment and decision-making across meteorology by improving uncertainty quantification, a longstanding challenge. This work establishes a strong foundation and motivation for the broader adoption of evidential learning, where properly quantifying uncertainties is critical yet lacking. 
    more » « less
  5. In 2021, people of Hispanic and Latinx origin made up 6% of the atmospheric and Earth sciences workforce of the United States, yet they represent 20% of the population. Motivated by this disparity in Hispanic and Latinx representation in the atmospheric and Earth science workforce, this manuscript documents the lack of representation through existing limited demographic data. The analysis presents a clear gap in participation by Hispanic and Latinx people in academic settings, with a widening gap through each education and career stage. Several factors and challenges impacting the representation disparity include the lack of funding for and collaboration with Hispanic-serving institutions, limited opportunities due to immigration status, and limited support for international research collaborations. We highlight the need for actionable steps to address the lack of representation and provide targeted recommendations to federal funding agencies, educational institutions, faculty, and potential employers. While we wait for systemic cultural change from our scientific institutions, grassroots initiatives like those proudly led by the AMS Committee for Hispanic and Latinx Advancement will emerge to address the needs of the Hispanic and Latinx scientific and broader community. We briefly highlight some of those achievements. Lasting cultural change can only happen if our leaders areactiveallies in the creation of a more diverse, equitable, and inclusive future. Alongside our active allies we will continue to champion for change in our weather, water, and climate enterprise. 
    more » « less
  6. Abstract Severe convection occurring in high-shear, low-CAPE (HSLC) environments is a common cool-season threat in the southeastern United States. Previous studies of HSLC convection document the increased operational challenges that these environments present compared to their high-CAPE counterparts, corresponding to higher false-alarm ratios and lower probability of detection for severe watches and warnings. These environments can exhibit rapid destabilization in the hours prior to convection, sometimes associated with the release of potential instability. Here, we use self-organizing maps (SOMs) to objectively identify environmental patterns accompanying HSLC cool-season severe events and associate them with variations in severe weather frequency and distribution. Large-scale patterns exhibit modest variation within the HSLC subclass, featuring strong surface cyclones accompanied by vigorous upper-tropospheric troughs and northward-extending regions of instability, consistent with prior studies. In most patterns, severe weather occurs immediately ahead of a cold front. Other convective ingredients, such as lower-tropospheric vertical wind shear, near-surface equivalent potential temperature (θe) advection, and the release of potential instability, varied more significantly across patterns. No single variable used to train SOMs consistently demonstrated differences in the distribution of severe weather occurrence across patterns. Comparison of SOMs based on upper and lower quartiles of severe occurrence demonstrated that the release of potential instability was most consistently associated with higher-impact events in comparison to other convective ingredients. Overall, we find that previously developed HSLC composite parameters reasonably identify high-impact HSLC events. Significance StatementEven when atmospheric instability is not optimal for severe convective storms, in some situations they can still occur, presenting increased challenges to forecasters. These marginal environments may occur at night or during the cool season, when people are less attuned to severe weather threats. Here, we use a sorting algorithm to classify different weather patterns accompanying such storms, and we distinguish which specific patterns and weather system features are most strongly associated with severe storms. Our goals are to increase situational awareness for forecasters and to improve understanding of the processes leading to severe convection in marginal environments. 
    more » « less
  7. Abstract. The potential for multiyear prediction of impactful Earthsystem change remains relatively underexplored compared to shorter(subseasonal to seasonal) and longer (decadal) timescales. In this study, weintroduce a new initialized prediction system using the Community EarthSystem Model version 2 (CESM2) that is specifically designed to probepotential and actual prediction skill at lead times ranging from 1 month outto 2 years. The Seasonal-to-Multiyear Large Ensemble (SMYLE) consists of acollection of 2-year-long hindcast simulations, with four initializations peryear from 1970 to 2019 and an ensemble size of 20. A full suite of output isavailable for exploring near-term predictability of all Earth systemcomponents represented in CESM2. We show that SMYLE skill for ElNiño–Southern Oscillation is competitive with other prominent seasonalprediction systems, with correlations exceeding 0.5 beyond a lead time of 12months. A broad overview of prediction skill reveals varying degrees ofpotential for useful multiyear predictions of seasonal anomalies in theatmosphere, ocean, land, and sea ice. The SMYLE dataset, experimentaldesign, model, initial conditions, and associated analysis tools are allpublicly available, providing a foundation for research on multiyearprediction of environmental change by the wider community. 
    more » « less