Abstract Subgrid‐scale processes, such as atmospheric gravity waves (GWs), play a pivotal role in shaping the Earth's climate but cannot be explicitly resolved in climate models due to limitations on resolution. Instead, subgrid‐scale parameterizations are used to capture their effects. Recently, machine learning (ML) has emerged as a promising approach to learn parameterizations. In this study, we explore uncertainties associated with a ML parameterization for atmospheric GWs. Focusing on the uncertainties in the training process (parametric uncertainty), we use an ensemble of neural networks to emulate an existing GW parameterization. We estimate both offline uncertainties in raw NN output and online uncertainties in climate model output, after the neural networks are coupled. We find that online parametric uncertainty contributes a significant source of uncertainty in climate model output that must be considered when introducing NN parameterizations. This uncertainty quantification provides valuable insights into the reliability and robustness of ML‐based GW parameterizations, thus advancing our understanding of their potential applications in climate modeling.
more »
« less
Non‐Local Parameterization of Atmospheric Subgrid Processes With Neural Networks
Abstract Subgrid processes in global climate models are represented by parameterizations which are a major source of uncertainties in simulations of climate. In recent years, it has been suggested that machine‐learning (ML) parameterizations based on high‐resolution model output data could be superior to traditional parameterizations. Currently, both traditional and ML parameterizations of subgrid processes in the atmosphere are based on a single‐column approach, which only use information from single atmospheric columns. However, single‐column parameterizations might not be ideal since certain atmospheric phenomena, such as organized convective systems, can cross multiple grid boxes and involve slantwise circulations that are not purely vertical. Here we train neural networks (NNs) using non‐local inputs spanning over 3 × 3 columns of inputs. We find that including the non‐local inputs improves the offline prediction of a range of subgrid processes. The improvement is especially notable for subgrid momentum transport and for atmospheric conditions associated with mid‐latitude fronts and convective instability. Using an interpretability method, we find that the NN improvements partly rely on using the horizontal wind divergence, and we further show that including the divergence or vertical velocity as a separate input substantially improves offline performance. However, non‐local winds continue to be useful inputs for parameterizating subgrid momentum transport even when the vertical velocity is included as an input. Overall, our results imply that the use of non‐local variables and the vertical velocity as inputs could improve the performance of ML parameterizations, and the use of these inputs should be tested in online simulations in future work.
more »
« less
- Award ID(s):
- 1906719
- PAR ID:
- 10376185
- Publisher / Repository:
- DOI PREFIX: 10.1029
- Date Published:
- Journal Name:
- Journal of Advances in Modeling Earth Systems
- Volume:
- 14
- Issue:
- 10
- ISSN:
- 1942-2466
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract This work integrates machine learning into an atmospheric parameterization to target uncertain mixing processes while maintaining interpretable, predictive, and well‐established physical equations. We adopt an eddy‐diffusivity mass‐flux (EDMF) parameterization for the unified modeling of various convective and turbulent regimes. To avoid drift and instability that plague offline‐trained machine learning parameterizations that are subsequently coupled with climate models, we frame learning as an inverse problem: Data‐driven models are embedded within the EDMF parameterization and trained online in a one‐dimensional vertical global climate model (GCM) column. Training is performed against output from large‐eddy simulations (LES) forced with GCM‐simulated large‐scale conditions in the Pacific. Rather than optimizing subgrid‐scale tendencies, our framework directly targets climate variables of interest, such as the vertical profiles of entropy and liquid water path. Specifically, we use ensemble Kalman inversion to simultaneously calibrate both the EDMF parameters and the parameters governing data‐driven lateral mixing rates. The calibrated parameterization outperforms existing EDMF schemes, particularly in tropical and subtropical locations of the present climate, and maintains high fidelity in simulating shallow cumulus and stratocumulus regimes under increased sea surface temperatures from AMIP4K experiments. The results showcase the advantage of physically constraining data‐driven models and directly targeting relevant variables through online learning to build robust and stable machine learning parameterizations.more » « less
-
Abstract We present single‐column gravity wave parameterizations (GWPs) that use machine learning to emulate non‐orographic gravity wave (GW) drag and demonstrate their ability to generalize out‐of‐sample. A set of artificial neural networks (ANNs) are trained to emulate the momentum forcing from a conventional GWP in an idealized climate model, given only one view of the annual cycle and one phase of the Quasi‐Biennial Oscillation (QBO). We investigate the sensitivity of offline and online performance to the choice of input variables and complexity of the ANN. When coupled with the model, moderately complex ANNs accurately generate full cycles of the QBO. When the model is forced with enhanced CO2, its climate response with the ANN matches that generated with the physics‐based GWP. That ANNs can accurately emulate an existing scheme and generalize to new regimes given limited data suggests the potential for developing GWPs from observational estimates of GW momentum transport.more » « less
-
Abstract. There has been a growing concern that most climate models predict precipitation that is too frequent, likely due to lack of reliable subgrid variabilityand vertical variations in microphysical processes in low-level warm clouds.In this study, the warm-cloud physics parameterizations in the singe-columnconfigurations of NCAR Community Atmospheric Model version 6 and 5 (SCAM6and SCAM5, respectively) are evaluated using ground-based and airborneobservations from the Department of Energy (DOE) Atmospheric Radiation Measurement (ARM) Aerosol and Cloud Experiments in the EasternNorth Atlantic (ACE-ENA) field campaign near the Azores islands during2017–2018. The 8-month single-column model (SCM) simulations show that both SCAM6 and SCAM5 cangenerally reproduce marine boundary layer cloud structure, majormacrophysical properties, and their transition. The improvement in warm-cloud properties from the Community Atmospheric Model 5 and 6 (CAM5 to CAM6) physics can be found through comparison with the observations. Meanwhile, both physical schemes underestimate cloud liquidwater content, cloud droplet size, and rain liquid water content butoverestimate surface rainfall. Modeled cloud condensation nuclei (CCN)concentrations are comparable with aircraft-observed ones in the summer but areoverestimated by a factor of 2 in winter, largely due to the biases in thelong-range transport of anthropogenic aerosols like sulfate. We also testthe newly recalibrated autoconversion and accretion parameterizations thataccount for vertical variations in droplet size. Compared to theobservations, more significant improvement is found in SCAM5 than in SCAM6.This result is likely explained by the introduction of subgrid variationsin cloud properties in CAM6 cloud microphysics, which further suppresses thescheme's sensitivity to individual warm-rain microphysical parameters. Thepredicted cloud susceptibilities to CCN perturbations in CAM6 are within areasonable range, indicating significant progress since CAM5 which produces anaerosol indirect effect that is too strong. The present study emphasizes theimportance of understanding biases in cloud physics parameterizations bycombining SCM with in situ observations.more » « less
-
Subgrid parameterizations of mesoscale eddies continue to be in demand for climate simulations. These subgrid parameterizations can be powerfully designed using physics and/or data‐driven methods, with uncertainty quantification. For example, Guillaumin and Zanna (2021) proposed a Machine Learning (ML) model that predicts subgrid forcing and its local uncertainty. The major assumption and potential drawback of this model is the statistical independence of stochastic residuals between grid points. Here, we aim to improve the simulation of stochastic forcing with generative models of ML, such as Generative adversarial network (GAN) and Variational autoencoder (VAE). Generative models learn the distribution of subgrid forcing conditioned on the resolved flow directly from data and they can produce new samples from this distribution. Generative models can potentially capture not only the spatial correlation but any statistically significant property of subgrid forcing. We test the proposed stochastic parameterizations offline and online in an idealized ocean model. We show that generative models are able to predict subgrid forcing and its uncertainty with spatially correlated stochastic forcing. Online simulations for a range of resolutions demonstrated that generative models are superior to the baseline ML model at the coarsest resolution.more » « less
An official website of the United States government
