skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: The graft‐versus‐host problem for data‐driven gravity‐wave parameterizations in a one‐dimensional quasibiennial oscillation model
Abstract Two key challenges in the development of data‐driven gravity‐wave parameterizations are generalization, how to ensure that a data‐driven scheme trained on the present‐day climate will continue to work in a new climate regime, and calibration, how to account for biases in the “host” climate model. Both problems depend fundamentally on the response to out‐of‐sample inputs compared with the training dataset, and are often conflicting. The ability to generalize to new climate regimes often goes hand in hand with sensitivity to model biases. To probe these challenges, we employ a one‐dimensional (1D) quasibiennial oscillation (QBO) model with a stochastic source term that represents convectively generated gravity waves in the Tropics with randomly varying strengths and spectra. We employ an array of machine‐learning models consisting of a fully connected feed‐forward neural network, a dilated convolutional neural network, an encoder–decoder, a boosted forest, and a support‐vector regression model. Our results demonstrate that data‐driven schemes trained on “observations” can be critically sensitive to model biases in the wave sources. While able to emulate accurately the stochastic source term on which they were trained, all of our schemes fail to simulate fully the expected QBO period or amplitude, even with the slightest perturbation to the wave sources. The main takeaway is that some measures will always be required to ensure the proper response to climate change and to account for model biases. We examine one approach based on the ideas of optimal transport, where the wave sources in the model are first remapped to the observed one before applying the data‐driven scheme. This approach is agnostic to the data‐driven method and guarantees that the model adheres to the observational constraints, making sure the model yields the right results for the right reasons.  more » « less
Award ID(s):
2004572
PAR ID:
10553495
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Royal Meteorological Society
Date Published:
Journal Name:
Quarterly Journal of the Royal Meteorological Society
Volume:
150
Issue:
761
ISSN:
0035-9009
Page Range / eLocation ID:
2255 to 2272
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract We present single‐column gravity wave parameterizations (GWPs) that use machine learning to emulate non‐orographic gravity wave (GW) drag and demonstrate their ability to generalize out‐of‐sample. A set of artificial neural networks (ANNs) are trained to emulate the momentum forcing from a conventional GWP in an idealized climate model, given only one view of the annual cycle and one phase of the Quasi‐Biennial Oscillation (QBO). We investigate the sensitivity of offline and online performance to the choice of input variables and complexity of the ANN. When coupled with the model, moderately complex ANNs accurately generate full cycles of the QBO. When the model is forced with enhanced CO2, its climate response with the ANN matches that generated with the physics‐based GWP. That ANNs can accurately emulate an existing scheme and generalize to new regimes given limited data suggests the potential for developing GWPs from observational estimates of GW momentum transport. 
    more » « less
  2. Abstract Tropical gravity waves that are generated by convection are generally too small in scale and too high in frequency to be resolved in global climate models, yet their drag forces drive the important global‐scale quasi‐biennial oscillation (QBO) in the lower stratosphere, and models rely on parameterizations of gravity wave drag to simulate the QBO. We compare detailed properties of tropical parameterized gravity waves in the Whole Atmosphere Community Climate Model version 6 (WACCM6) with gravity waves observed by long‐duration superpressure balloons and also compare properties of parameterized convective latent heating with satellite data. Similarities and differences suggest that the WACCM6 parameterizations are excellent tools for representing tropical gravity waves, but the results also suggest detailed changes to the gravity wave parameterization tuning parameter assumptions that would bring the parameterized waves into much better agreement with observations. While WACCM6 currently includes only nonstationary gravity waves from convection, adding gravity waves generated by the steady component of the heating that are stationary relative to moving convective rain cells is likely to improve the simulation of the QBO in the model. The suggested changes have the potential to alleviate common biases in simulated QBO circulations in models. 
    more » « less
  3. Abstract Recent years have seen a surge in interest in building deep learning-based fully data-driven models for weather prediction. Such deep learning models, if trained on observations can mitigate certain biases in current state-of-the-art weather models, some of which stem from inaccurate representation of subgrid-scale processes. However, these data-driven models, being over-parameterized, require a lot of training data which may not be available from reanalysis (observational data) products. Moreover, an accurate, noise-free, initial condition to start forecasting with a data-driven weather model is not available in realistic scenarios. Finally, deterministic data-driven forecasting models suffer from issues with long-term stability and unphysical climate drift, which makes these data-driven models unsuitable for computing climate statistics. Given these challenges, previous studies have tried to pre-train deep learning-based weather forecasting models on a large amount of imperfect long-term climate model simulations and then re-train them on available observational data. In this article, we propose a convolutional variational autoencoder (VAE)-based stochastic data-driven model that is pre-trained on an imperfect climate model simulation from a two-layer quasi-geostrophic flow and re-trained, using transfer learning, on a small number of noisy observations from a perfect simulation. This re-trained model then performs stochastic forecasting with a noisy initial condition sampled from the perfect simulation. We show that our ensemble-based stochastic data-driven model outperforms a baseline deterministic encoder–decoder-based convolutional model in terms of short-term skills, while remaining stable for long-term climate simulations yielding accurate climatology. 
    more » « less
  4. Abstract We train random and boosted forests, two machine learning architectures based on regression trees, to emulate a physics‐based parameterization of atmospheric gravity wave momentum transport. We compare the forests to a neural network benchmark, evaluating both offline errors and online performance when coupled to an atmospheric model under the present day climate and in 800 and 1,200 ppm CO2global warming scenarios. Offline, the boosted forest exhibits similar skill to the neural network, while the random forest scores significantly lower. Both forest models couple stably to the atmospheric model, and control climate integrations with the boosted forest exhibit lower biases than those with the neural network. Integrations with all three data‐driven emulators successfully capture the Quasi‐Biennial Oscillation (QBO) and sudden stratospheric warmings, key modes of stratospheric variability, with the boosted forest more accurate than the random forest in replicating their statistics across our range of carbon dioxide perturbations. The boosted forest and neural network capture the sign of the QBO period response to increased CO2, though both struggle with the magnitude of this response under the more extreme 1,200 ppm scenario. To investigate the connection between performance in the control climate and the ability to generalize, we use techniques from interpretable machine learning to understand how the data‐driven methods use physical information. We leverage this understanding to develop a retraining procedure that improves the coupled performance of the boosted forest in the control climate and under the 800 ppm CO2scenario. 
    more » « less
  5. Abstract An intermediate complexity moist general circulation model is used to investigate the sensitivity of the quasi‐biennial oscillation (QBO) to resolution, diffusion, tropical tropospheric waves, and parameterized gravity waves. Finer horizontal resolution is shown to lead to a shorter period, while finer vertical resolution is shown to lead to a longer period and to a larger amplitude in the lowermost stratosphere. More scale‐selective diffusion leads to a faster and stronger QBO, while enhancing the sources of tropospheric stationary wave activity leads to a weaker QBO. In terms of parameterized gravity waves, broadening the spectral width of the source function leads to a longer period and a stronger amplitude although the amplitude effect saturates in the mid‐stratosphere when the half‐width exceedsm/s. A stronger gravity wave source stress leads to a faster and stronger QBO, and a higher gravity wave launch level leads to a stronger QBO. All of these sensitivities are shown to result from their impact on the resultant wave‐driven momentum torque in the tropical stratosphere. Atmospheric models have struggled to accurately represent the QBO, particularly at moderate resolutions ideal for long climate integrations. In particular, capturing the amplitude and penetration of QBO anomalies into the lower stratosphere (which has been shown to be critical for the tropospheric impacts) has proven a challenge. The results provide a recipe to generate and/or improve the simulation of the QBO in an atmospheric model. 
    more » « less