Search for: All records

Creators/Authors contains: "Poterjoy, Jonathan"

« Prev Next »

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Innovation-Based Methods for Online Estimates of Observation Error Variances within Ensemble Data Assimilation Cycles

https://doi.org/10.1175/MWR-D-24-0242.1

Santer, Henry; Poterjoy, Jonathan; El_Gharamti, Mohamad (September 2025, Monthly Weather Review)

Abstract Many data assimilation methods require knowledge of the first two moments of the background and observation errors to function optimally. To ensure the effective performance of such methods, it is often advantageous to estimate the second moment of the observation errors directly. We examine three different strategies for doing so, focusing specifically on the case of a single scalar observation error variance parameterr. The first method is the well-known Desroziers et al. “diagnostic check” iteration (DBCP). The second method, described in Karspeck, adapts the “spread–error” diagnostic—used for assessing ensemble reliability—to observations and generates a point estimate ofrby taking the expectation of various observation-space statistics and using an ensemble to model background error statistics explicitly. The third method is an approximate Bayesian scheme that uses an inverse-gamma prior and a modified Gaussian likelihood. All three methods can recover the correct observation error variance when both the background and observation errors are Gaussian and the background error variance is well specified. We also demonstrate that it is often possible to estimatereven when the observation error is not Gaussian or when the forward operator mapping model states into observation space is nonlinear. The DBCP method is found to be most robust to these complications; however, the other two methods perform similarly well in most cases and have the added benefit that they can be used to estimaterbefore data assimilation. We conclude that further investigation is warranted into the latter two methods, specifically into how they perform when extended to the multivariate case. Significance StatementObservations of the Earth system (e.g., from satellites, radiosondes, aircraft, etc.,) each have some associated uncertainty. To use observations to improve model forecasts, it is important to understand the size of that uncertainty. This study compares three statistical methods for estimating observation errors, all of which can be continuously implemented whenever new observations are used to correct a model. Our results suggest that all three methods can improve forecast outcomes, but that, if observations are believed to have highly biased or skewed errors, care should be taken in choosing which to use and interpreting its results. Future studies should investigate robust methods for estimating more complicated types of errors.
more » « less
Free, publicly-accessible full text available September 1, 2026
Optimizing Numerical Weather Prediction Utility of the Maryland Mesonet with Observing System Simulation Experiments

https://doi.org/10.1175/WAF-D-24-0089.1

McCurry, Joshua; Poterjoy, Jonathan (December 2024, Weather and Forecasting)

Abstract The Maryland Mesonet project will construct a network of 75 surface observing stations with aims that include mitigating the statewide impact of severe convective storms and improving analyses of records. The spatial configuration of mesonet stations is expected to affect the utility newly provided observations will have via data assimilation, making it desirable to study the effects of mesonet configuration. Furthermore, the impact associated with any observing system configuration is constrained by errors inherent to the prediction systems used to generate forecasts, which may change with future advances in data assimilation methodology, physical parameterization schemes, and resource availability. To address such possibilities, we perform sets of observing system simulation experiments using a high-resolution regional modeling system to assess the expected impact of four candidate mesonet configurations. Experiments cover seven 18-h case study events featuring moist convective regimes associated with severe weather over the state of Maryland and are performed using two versions of our experimental modeling system: a “standard-uncertainty” configuration tuned to be representative of existing convective-allowing prediction systems and a “constrained-uncertainty” configuration with reduced boundary condition and model error that reflects a possible trajectory for future prediction systems. We find that the assimilation of mesonet data produces definitive improvements to analysis fields below 1000 m that are mediated by modeling system uncertainty. Conversely, mesonet impact on forecast verification is inconclusive and strongly variable across verification metrics. The impact of mesonet configuration appears limited by a saturation effect that caps local analysis improvements past a minimal density of observing stations. Significance StatementThe Maryland Mesonet project will construct 75 surface observing stations to improve the analysis of records for Maryland’s surface weather conditions as well as predictions for severe weather events. The spatial placement of sensors is expected to influence the utility of a mesonet, making it desirable to optimize mesonet layouts. The utility provided by a mesonet may also be impacted by errors in prediction systems used to generate analyses and forecasts, which are themselves subject to change given future advances in prediction methods and resources. This study uses observing system simulation experiments (OSSEs)—which comprehensively simulate numerical weather prediction for a known “truth state” —to characterize improvement we may expect from mesonet observations and evaluate four potential mesonet configurations.
more » « less
Full Text Available
An Evaluation of Non-Gaussian Data Assimilation Methods in Moist Convective Regimes

https://doi.org/10.1175/MWR-D-22-0260.1

McCurry, Joshua; Poterjoy, Jonathan; Knopfmeier, Kent; Wicker, Louis (July 2023, Monthly Weather Review)

Abstract Obtaining a faithful probabilistic depiction of moist convection is complicated by unknown errors in subgrid-scale physical parameterization schemes, invalid assumptions made by data assimilation (DA) techniques, and high system dimensionality. As an initial step toward untangling sources of uncertainty in convective weather regimes, we evaluate a novel Bayesian data assimilation methodology based on particle filtering within a WRF ensemble analysis and forecasting system. Unlike most geophysical DA methods, the particle filter (PF) represents prior and posterior error distributions nonparametrically rather than assuming a Gaussian distribution and can accept any type of likelihood function. This approach is known to reduce bias introduced by Gaussian approximations in low-dimensional and idealized contexts. The form of PF used in this research adopts a dimension-reduction strategy, making it affordable for typical weather applications. The present study examines posterior ensemble members and forecasts for select severe weather events between 2019 and 2020, comparing results from the PF with those from an ensemble Kalman filter (EnKF). We find that assimilating with a PF produces posterior quantities for microphysical variables that are more consistent with model climatology than comparable quantities from an EnKF, which we attribute to a reduction in DA bias. These differences are significant enough to impact the dynamic evolution of convective systems via cold pool strength and propagation, with impacts to forecast verification scores depending on the particular microphysics scheme. Our findings have broad implications for future approaches to the selection of physical parameterization schemes and parameter estimation within preexisting data assimilation frameworks. Significance StatementThe accurate prediction of severe storms using numerical weather models depends on effective parameterization schemes for small-scale processes and the assimilation of incomplete observational data in a manner that faithfully represents the probabilistic state of the atmosphere. Current generation methods for data assimilation typically assume a standard form for the error distributions of relevant quantities, which can introduce bias that not only hinders numerical prediction, but that can also confound the characterization of errors from the model itself. The current study performs data assimilation using a novel method that does not make such assumptions and explores characteristics of resulting model fields and forecasts that might make such a method useful for improving model parameterization schemes.
more » « less
Full Text Available
A Statistical Hypothesis Testing Strategy for Adaptively Blending Particle Filters and Ensemble Kalman Filters for Data Assimilation

https://doi.org/10.1175/MWR-D-22-0108.1

Kurosawa, Kenta; Poterjoy, Jonathan (January 2023, Monthly Weather Review)

Abstract Particle filters avoid parametric estimates for Bayesian posterior densities, which alleviates Gaussian assumptions in nonlinear regimes. These methods, however, are more sensitive to sampling errors than Gaussian-based techniques such as ensemble Kalman filters. A recent study by the authors introduced an iterative strategy for particle filters that match posterior moments—where iterations improve the filter’s ability to draw samples from non-Gaussian posterior densities. The iterations follow from a factorization of particle weights, providing a natural framework for combining particle filters with alternative filters to mitigate the impact of sampling errors. The current study introduces a novel approach to forming an adaptive hybrid data assimilation methodology, exploiting the theoretical strengths of nonparametric and parametric filters. At each data assimilation cycle, the iterative particle filter performs a sequence of updates while the prior sample distribution is non-Gaussian, then an ensemble Kalman filter provides the final adjustment when Gaussian distributions for marginal quantities are detected. The method employs the Shapiro–Wilk test to determine when to make the transition between filter algorithms, which has outstanding power for detecting departures from normality. Experiments using low-dimensional models demonstrate that the approach has a significant value, especially for nonhomogeneous observation networks and unknown model process errors. Moreover, hybrid factors are extended to consider marginals of more than one collocated variables using a test for multivariate normality. Findings from this study motivate the use of the proposed method for geophysical problems characterized by diverse observation networks and various dynamic instabilities, such as numerical weather prediction models. Significance Statement Data assimilation statistically processes observation errors and model forecast errors to provide optimal initial conditions for the forecast, playing a critical role in numerical weather forecasting. The ensemble Kalman filter, which has been widely adopted and developed in many operational centers, assumes Gaussianity of the prior distribution and solves a linear system of equations, leading to bias in strong nonlinear regimes. On the other hand, particle filters avoid many of those assumptions but are sensitive to sampling errors and are computationally expensive. We propose an adaptive hybrid strategy that combines their advantages and minimizes the disadvantages of the two methods. The hybrid particle filter–ensemble Kalman filter is achieved with the Shapiro–Wilk test to detect the Gaussianity of the ensemble members and determine the timing of the transition between these filter updates. Demonstrations in this study show that the proposed method is advantageous when observations are heterogeneous and when the model has an unknown bias. Furthermore, by extending the statistical hypothesis test to the test for multivariate normality, we consider marginals of more than one collocated variable. These results encourage further testing for real geophysical problems characterized by various dynamic instabilities, such as real numerical weather prediction models.
more » « less
Full Text Available
Implications of Multivariate Non-Gaussian Data Assimilation for Multi-scale Weather Prediction

https://doi.org/10.1175/MWR-D-21-0228.1

Poterjoy, Jonathan (March 2022, Monthly Weather Review)

Abstract Weather prediction models currently operate within a probabilistic framework for generating forecasts conditioned on recent measurements of Earth’s atmosphere. This framework can be conceptualized as one that approximates parts of a Bayesian posterior density estimated under assumptions of Gaussian errors. Gaussian error approximations are appropriate for synoptic-scale atmospheric flow, which experiences quasi-linear error evolution over time scales depicted by measurements, but are often hypothesized to be inappropriate for highly nonlinear, sparsely-observed mesoscale processes. The current study adopts an experimental regional modeling system to examine the impact of Gaussian prior error approximations, which are adopted by ensemble Kalman filters (EnKFs) to generate probabilistic predictions. The analysis is aided by results obtained using recently-introduced particle filter (PF) methodology that relies on an implicit non-parametric representation of prior probability densities—but with added computational expense. The investigation focuses on EnKF and PF comparisons over month-long experiments performed using an extensive domain, which features the development and passage of numerous extratropical and tropical cyclones. The experiments reveal spurious small-scale corrections in EnKF members, which come about from inappropriate Gaussian approximations for priors dominated by alignment uncertainty in mesoscale weather systems. Similar behavior is found in PF members, owing to the use of a localization operator, but to a much lesser extent. This result is reproduced and studied using a low-dimensional model, which permits the use of large sample estimates of the Bayesian posterior distribution. Findings from this study motivate the use of data assimilation techniques that provide a more appropriate specification of multivariate non-Gaussian prior densities or a multi-scale treatment of alignment errors during data assimilation.
more » « less
Full Text Available
Regularization and tempering for a moment‐matching localized particle filter

https://doi.org/10.1002/qj.4328

Poterjoy, Jonathan (June 2022, Quarterly Journal of the Royal Meteorological Society)

Abstract Iterative ensemble filters and smoothers are now commonly used for geophysical models. Some of these methods rely on a factorization of the observation likelihood function to sample from a posterior density through a set of “tempered” transitions to ensemble members. For Gaussian‐based data assimilation methods, tangent linear versions of nonlinear operators can be relinearized between iterations, thus leading to a solution that is less biased than a single‐step approach. This study adopts similar iterative strategies for a localized particle filter (PF) that relies on the estimation of moments to adjust unobserved variables based on importance weights. This approach builds off a “regularization” of the local PF, which forces weights to be more uniform through heuristic means. The regularization then leads to an adaptive tempering, which can also be combined with filter updates from parametric methods, such as ensemble Kalman filters. The role of iterations is analyzed by deriving the localized posterior probability density assumed by current local PF formulations and then examining how single‐step and tempered PFs sample from this density. From experiments performed with a low‐dimensional nonlinear system, the iterative and hybrid strategies show the largest benefits in observation‐sparse regimes, where only a few particles contain high likelihoods and prior errors are non‐Gaussian. This regime mimics specific applications in numerical weather prediction, where small ensemble sizes, unresolved model error, and highly nonlinear dynamics lead to prior uncertainty that is larger than measurement uncertainty.
more » « less
Challenges for Inline Observation Error Estimation in the Presence of Misspecified Background Uncertainty

https://doi.org/10.1175/MWR-D-22-0298.1

Walsworth, Andrew; Poterjoy, Jonathan; Satterfield, Elizabeth (September 2023, Monthly Weather Review)

Abstract For data assimilation to provide faithful state estimates for dynamical models, specifications of observation uncertainty need to be as accurate as possible. Innovation-based methods based on Desroziers diagnostics, are commonly used to estimate observation uncertainty, but such methods can depend greatly on the prescribed background uncertainty. For ensemble data assimilation, this uncertainty comes from statistics calculated from ensemble forecasts, which require inflation and localization to address under sampling. In this work, we use an ensemble Kalman filter (EnKF) with a low-dimensional Lorenz model to investigate the interplay between the Desroziers method and inflation. Two inflation techniques are used for this purpose: 1) a rigorously tuned fixed multiplicative scheme and 2) an adaptive state-space scheme. We document how inaccuracies in observation uncertainty affect errors in EnKF posteriors and study the combined impacts of misspecified initial observation uncertainty, sampling error, and model error on Desroziers estimates. We find that whether observation uncertainty is over- or underestimated greatly affects the stability of data assimilation and the accuracy of Desroziers estimates and that preference should be given to initial overestimates. Inline estimates of Desroziers tend to remove the dependence between ensemble spread–skill and the initially prescribed observation error. In addition, we find that the inclusion of model error introduces spurious correlations in observation uncertainty estimates. Further, we note that the adaptive inflation scheme is less robust than fixed inflation at mitigating multiple sources of error. Last, sampling error strongly exacerbates existing sources of error and greatly degrades EnKF estimates, which translates into biased Desroziers estimates of observation error covariance. Significance StatementTo generate accurate predictions of various components of the Earth system, numerical models require an accurate specification of state variables at our current time. This step adopts a probabilistic consideration of our current state estimate versus information provided from environmental measurements of the true state. Various strategies exist for estimating uncertainty in observations within this framework, but are sensitive to a host of assumptions, which are investigated in this study.
more » « less
Evaluating Contour Band Depth as a Method for Understanding Ensemble Uncertainty

https://doi.org/10.1175/MWR-D-22-0281.1

Santer, Henry; Poterjoy, Jonathan; McCurry, Joshua (August 2023, Monthly Weather Review)

Abstract Estimating and predicting the state of the atmosphere is a probabilistic problem for which an ensemble modeling approach often is taken to represent uncertainty in the system. Common methods for examining uncertainty and assessing performance for ensembles emphasize pointwise statistics or marginal distributions. However, these methods lose specific information about individual ensemble members. This paper explores contour band depth (cBD), a method of analyzing uncertainty in terms of contours of scalar fields. cBD is fully nonparametric and induces an ordering on ensemble members that leads to box-and-whisker-plot-type visualizations of uncertainty for two-dimensional data. By applying cBD to synthetic ensembles, we demonstrate that it provides enhanced information about the spatial structure of ensemble uncertainty. We also find that the usefulness of the cBD analysis depends on the presence of multiple modes and multiple scales in the ensemble of contours. Finally, we apply cBD to compare various convection-permitting forecasts from different ensemble prediction systems and find that the value it provides in real-world applications compared to standard analysis methods exhibits clear limitations. In some cases, contour boxplots can provide deeper insight into differences in spatial characteristics between the different ensemble forecasts. Nevertheless, identification of outliers using cBD is not always intuitive, and the method can be especially challenging to implement for flow that exhibits multiple spatial scales (e.g., discrete convective cells embedded within a mesoscale weather system). Significance StatementPredictions of Earth’s atmosphere inherently come with some degree of uncertainty owing to incomplete observations and the chaotic nature of the system. Understanding that uncertainty is critical when drawing scientific conclusions or making policy decisions from model predictions. In this study, we explore a method for describing model uncertainty when the quantities of interest are well represented by contours. The method yields a quantitative visualization of uncertainty in both the location and the shape of contours to an extent that is not possible with standard uncertainty quantification methods and may eventually prove useful for the development of more robust techniques for evaluating and validating numerical weather models.
more » « less