skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on March 1, 2026

Title: Evaluation of 0–6-Hour Forecasts from the Experimental Warn-on-Forecast System and the Hybrid Analysis and Forecast System for Real-Time Cases in 2021
Abstract This study compares real-time forecasts produced by the Warn-on-Forecast System (WoFS) and a hybrid ensemble and variational data assimilation and prediction system (WoF-Hybrid) for 31 events during 2021. Object-based verification is used to quantify and compare strengths and weaknesses of WoFS ensemble forecasts with 3-km horizontal grid spacing and WoF-Hybrid deterministic forecasts with 1.5-km horizontal grid spacing. The goal of such comparison is to provide evidence as to whether WoF-Hybrid has performance characteristics that complement or improve upon those of WoFS. Results indicate that both systems provide similar accuracy for timing and placement of thunderstorm objects identified using simulated reflectivity. WoF-Hybrid provides more accurate forecasts of updraft helicity tracks. Differences in forecast quality are case dependent; the largest difference in accuracy favoring WoF-Hybrid occurs in eight cases identified as “high-impact” by the quantity of National Weather Service Local Storm Reports, while WoFS performance is favored at short lead times for 10 “moderate-” and 13 “low-impact” events. WoF-Hybrid reflectivity objects are closer in size and location to observed objects. However, a higher thunderstorm overprediction bias is identified in WoF-Hybrid, particularly early in the forecast. Two severe weather events are selected for detailed investigation. In the case of 26 May, both systems had similar skill; however, for 10 December, WoF-Hybrid forecasts significantly outperformed WoFS forecasts. These results show improved performance for WoF-Hybrid over WoFS under certain regimes that warrants further investigation. To understand reasons for these differences will help incorporate higher-resolution modeling into Warn-on-Forecast systems. Significance StatementThe NOAA Warn-on-Forecast (WoF) project uses advanced data assimilation for rapidly updating numerical weather prediction systems to provide forecasts of individual thunderstorms. Forecasts show promise for enabling greater warning lead time for some storms. The flagship Warn-on-Forecast System (WoFS) is a 36-member analysis and 18-member forecast system at 3-km grid spacing. The project also produced a single member system that employs variational analysis and produces a deterministic forecast at 1.5-km grid spacing (WoF-Hybrid). This study seeks to evaluate and compare the performance of WoFS and WoF-Hybrid for 31 severe weather events that occurred during 2021. Results found that WoF-Hybrid predicts storm rotation particularly well compared to WoFS, and several other strengths and limitations of both systems are identified. This research may help us understand the complementary nature of two systems and improve our ability to provide more reliable 0–6-h forecasts in the future.  more » « less
Award ID(s):
2136161
PAR ID:
10656708
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  ;  ;  ;  
Publisher / Repository:
American Meteorological Society
Date Published:
Journal Name:
Weather and Forecasting
Volume:
40
Issue:
3
ISSN:
0882-8156
Page Range / eLocation ID:
451 to 469
Subject(s) / Keyword(s):
Forecasting Short-range prediction Data assimilation Model evaluation/performance Numerical weather prediction/forecasting
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Forecasts of tropical cyclone (TC) tornadoes are less skillful than their non‐TC counterparts at all lead times. The development of a convection‐allowing regional ensemble, known as the Warn‐on‐Forecast System (WoFS), may help improve short‐fused TC tornado forecasts. As a first step, this study investigates the fidelity of convective‐scale kinematic and thermodynamic environments to a preliminary set of soundings from WoFS forecasts for comparison with radiosondes for selected 2020 landfalling TCs. Our study shows reasonable agreement between TC convective‐scale kinematic environments in WoFS versus observed soundings at all forecast lead times. Nonetheless, WoFS is biased toward weaker than observed TC‐relative radial winds, and stronger than observed near‐surface tangential winds with weaker winds aloft, during the forecast. Analysis of storm‐relative helicity (SRH) shows that WoFS underestimates extreme observed values. Convective‐scale thermodynamic environments are well simulated for both temperature and dewpoint at all lead times. However, WoFS is biased moister with steeper lapse rates compared to observations during the forecast. Both CAPE and, to a lesser extent, 0–3‐km CAPE distributions are narrower in WoFS than in radiosondes, with an underestimation of higher CAPE values. Together, these results suggest that WoFS may have utility for forecasting convective‐scale environments in landfalling TCs with lead times of several hours. 
    more » « less
  2. Abstract Observational data collection is extremely hazardous in supercell storm environments, which makes for a scarcity of data used for evaluating the storm-scale guidance from convection allowing models (CAMs) like the National Oceanic and Atmospheric Administration (NOAA) Warn-on-Forecast System (WoFS). The Targeted Observations with UAS and Radar of Supercells (TORUS) 2019 field mission provided a rare opportunity to not only collect these observations, but to do so with advanced technology: vertically pointing Doppler lidar. One standing question for WoFS is how the system forecasts the feedback between supercells and their near-storm environment. The lidar can observe vertical profiles of wind over time, creating unique datasets to compare to WoFS kinematic predictions in rapidly evolving severe weather environments. Mobile radiosonde data are also presented to provide a thermodynamic comparison. The five lidar deployments (three of which observed tornadic supercells) analyzed show WoFS accurately predicted general kinematic trends in the inflow environment; however, the predicted feedback between the supercell and its environment, which resulted in enhanced inflow and larger storm-relative helicity (SRH), were muted relative to observations. The radiosonde observations reveal an overprediction of CAPE in WoFS forecasts, both in the near and far field, with an inverse relationship between the CAPE errors and distance from the storm. Significance Statement It is difficult to evaluate the accuracy of weather prediction model forecasts of severe thunderstorms because observations are rarely available near the storms. However, the TORUS 2019 field experiment collected multiple specialized observations in the near-storm environment of supercells, which are compared to the same near-storm environments predicted by the National Oceanic and Atmospheric Administration (NOAA) Warn-on-Forecast System (WoFS) to gauge its performance. Unique to this study is the use of mobile Doppler lidar observations in the evaluation; lidar can retrieve the horizontal winds in the few kilometers above ground on time scales of a few minutes. Using lidar and radiosonde observations in the near-storm environment of three tornadic supercells, we find that WoFS generally predicts the expected trends in the evolution of the near-storm wind profile, but the response is muted compared to observations. We also find an inverse relationship of errors in instability to distance from the storm. These results can aid model developers in refining model physics to better predict severe storms. 
    more » « less
  3. Abstract A primary goal of the National Oceanic and Atmospheric Administration Warn-on-Forecast (WoF) project is to provide rapidly updating probabilistic guidance to human forecasters for short-term (e.g., 0–3 h) severe weather forecasts. Postprocessing is required to maximize the usefulness of probabilistic guidance from an ensemble of convection-allowing model forecasts. Machine learning (ML) models have become popular methods for postprocessing severe weather guidance since they can leverage numerous variables to discover useful patterns in complex datasets. In this study, we develop and evaluate a series of ML models to produce calibrated, probabilistic severe weather guidance from WoF System (WoFS) output. Our dataset includes WoFS ensemble forecasts available every 5 min out to 150 min of lead time from the 2017–19 NOAA Hazardous Weather Testbed Spring Forecasting Experiments (81 dates). Using a novel ensemble storm-track identification method, we extracted three sets of predictors from the WoFS forecasts: intrastorm state variables, near-storm environment variables, and morphological attributes of the ensemble storm tracks. We then trained random forests, gradient-boosted trees, and logistic regression algorithms to predict which WoFS 30-min ensemble storm tracks will overlap a tornado, severe hail, and/or severe wind report. To provide rigorous baselines against which to evaluate the skill of the ML models, we extracted the ensemble probabilities of hazard-relevant WoFS variables exceeding tuned thresholds from each ensemble storm track. The three ML algorithms discriminated well for all three hazards and produced more reliable probabilities than the baseline predictions. Overall, the results suggest that ML-based postprocessing of dynamical ensemble output can improve short-term, storm-scale severe weather probabilistic guidance. 
    more » « less
  4. Abstract Hailstorms cause billions of dollars in damage across the United States each year. Part of this cost could be reduced by increasing warning lead times. To contribute to this effort, we developed a nowcasting machine learning model that uses a 3D U-Net to produce gridded severe hail nowcasts for up to 40 min in advance. The three U-Net dimensions uniquely incorporate one temporal and two spatial dimensions. Our predictors consist of a combination of output from the National Severe Storms Laboratory Warn-on-Forecast System (WoFS) numerical weather prediction ensemble and remote sensing observations from Vaisala’s National Lightning Detection Network (NLDN). Ground truth for prediction was derived from the maximum expected size of hail calculated from the gridded NEXRAD WSR-88D radar (GridRad) dataset. Our U-Net was evaluated by comparing its test set performance against rigorous hail nowcasting baselines. These baselines included WoFS ensemble Hail and Cloud Growth Model (HAILCAST) and a logistic regression model trained on WoFS 2–5-km updraft helicity. The 3D U-Net outperformed both these baselines for all forecast period time steps. Its predictions yielded a neighborhood maximum critical success index (max CSI) of ∼0.48 and ∼0.30 at forecast minutes 20 and 40, respectively. These max CSIs exceeded the ensemble HAILCAST max CSIs by as much as ∼0.35. The NLDN observations were found to increase the U-Net performance by more than a factor of 4 at some time steps. This system has shown success when nowcasting hail during complex severe weather events, and if used in an operational environment, may prove valuable. 
    more » « less
  5. Abstract Assimilating radar reflectivity into convective-scale NWP models remains a challenging topic in radar data assimilation. A primary reason is that the reflectivity forward observation operator is highly nonlinear. To address this challenge, a power transformation function is applied to the WRF Model’s hydrometeor and water vapor mixing ratio variables in this study. Three 3D variational data assimilation experiments are performed and compared for five high-impact weather events that occurred in 2019: (i) a control experiment that assimilates reflectivity using the original hydrometeor mixing ratios as control variables, (ii) an experiment that assimilates reflectivity using power-transformed hydrometeor mixing ratios as control variables, and (iii) an experiment that assimilates reflectivity and retrieved pseudo–water vapor observations using power-transformed hydrometeor and water vapor mixing ratios (qυ) as control variables. Both qualitative and quantitative evaluations are performed for 0–3-h forecasts from the five cases. The analysis and forecast performance in the two experiments with power-transformed mixing ratios is better than the control experiment. Notably, the assimilation of pseudo–water vapor with power-transformedqυas an additional control variable is found to improve the performance of the analysis and short-term forecasts for all cases. In addition, the convergence rate of the cost function minimization for the two experiments that use the power transformation is faster than that of the control experiments. Significance StatementThe effective use of radar reflectivity observations in any data assimilation scheme remains an important research topic because reflectivity observations explicitly include information about hydrometeors and also implicitly include information about the distribution of moisture within storms. However, it is difficult to assimilate reflectivity because the reflectivity forward observation operator is highly nonlinear. This study seeks to identify a more effective way to assimilate reflectivity into a convective-scale NWP model to improve the accuracy of predictions of high-impact weather events. 
    more » « less