skip to main content


Title: Systematic Bias Correction in Ocean Mesoscale Forecasting Using Machine Learning
Abstract

The ocean circulation is modulated by meandering currents and eddies. Forecasting their evolution is a key target of operational models, but their forecast skill remains limited. We propose a machine learning approach that improves the output of an ocean circulation model by learning and predicting its systematic biases. This method can be applied a priori to any region, and is tested in the Gulf of Mexico, where the Loop Current (LC) and the large anticyclonic eddies that detach from it are major forecasting targets. The LC dynamics are recurrent and lie on a low‐dimensional dynamical attractor. Building upon the information gained analyzing this low dimensional attractor, we improve the representation of sea surface anomalies in model outputs through information from satellite altimeter data using a Sequence‐to‐Sequence model, which is a special class of Recurrent Neural Network. Building upon the HYCOM‐NCODA analysis system, we deliver a correction to the forecast at the observation resolution. For at least 15 days the proposed method learns to forecast the systematic bias in the HYCOM‐NCODA, outperforming persistence, and improving the forecast. This data‐driven approach is fast and can be implemented as an added step to any dynamical hindcasting or forecasting model. It offers an interesting avenue for further developing hybrid modeling tools. In these tools, fundamental physical conservations are preserved through the integration of partial differential equations which obey them. In addition, the method highlights specific deficiencies of the hindcast system that deserve further investigation in the future.

 
more » « less
NSF-PAR ID:
10473137
Author(s) / Creator(s):
 ;  ;  
Publisher / Repository:
DOI PREFIX: 10.1029
Date Published:
Journal Name:
Journal of Advances in Modeling Earth Systems
Volume:
15
Issue:
11
ISSN:
1942-2466
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. According to the National Academies, a week long forecast of velocity, vertical structure, and duration of the Loop Current (LC) and its eddies at a given location is a critical step toward understanding their effects on the gulf ecosystems as well as toward anticipating and mitigating the outcomes of anthropogenic and natural disasters in the Gulf of Mexico (GoM). However, creating such a forecast has remained a challenging problem since LC behavior is dominated by dynamic processes across multiple time and spatial scales not resolved at once by conventional numerical models. In this paper, building on the foundation of spatiotemporal predictive learning in video prediction, we develop a physics informed deep learning based prediction model called—Physics-informed Tensor-train ConvLSTM (PITT-ConvLSTM)—for forecasting 3D geo-spatiotemporal sequences. Specifically, we propose (1) a novel 4D higher-order recurrent neural network with empirical orthogonal function analysis to capture the hidden uncorrelated patterns of each hierarchy, (2) a convolutional tensor-train decomposition to capture higher-order space-time correlations, and (3) a mechanism that incorporates prior physics from domain experts by informing the learning in latent space. The advantage of our proposed approach is clear: constrained by the law of physics, the prediction model simultaneously learns good representations for frame dependencies (both short-term and long-term high-level dependency) and inter-hierarchical relations within each time frame. Experiments on geo-spatiotemporal data collected from the GoM demonstrate that the PITT-ConvLSTM model can successfully forecast the volumetric velocity of the LC and its eddies for a period greater than 1 week. 
    more » « less
  2. Despite the large efforts made by the ocean modeling community, such as the GODAE (Global Ocean Data Assimilation Experiment), which started in 1997 and was renamed as OceanPredict in 2019, the prediction of ocean currents has remained a challenge until the present day—particularly in ocean regions that are characterized by rapid changes in their circulation due to changes in atmospheric forcing or due to the release of available potential energy through the development of instabilities. Ocean numerical models’ useful forecast window is no longer than two days over a given area with the best initialization possible. Predictions quickly diverge from the observational field throughout the water and become unreliable, despite the fact that they can simulate the observed dynamics through other variables such as temperature, salinity and sea surface height. Numerical methods such as harmonic analysis are used to predict both short- and long-term tidal currents with significant accuracy. However, they are limited to the areas where the tide was measured. In this study, a new approach to ocean current prediction based on deep learning is proposed. This method is evaluated on the measured energetic currents of the Gulf of Mexico circulation dominated by the Loop Current (LC) at multiple spatial and temporal scales. The approach taken herein consists of dividing the velocity tensor into planes perpendicular to each of the three Cartesian coordinate system directions. A Long Short-Term Memory Recurrent Neural Network, which is best suited to handling long-term dependencies in the data, was thus used to predict the evolution of the velocity field in each plane, along each of the three directions. The predicted tensors, made of the planes perpendicular to each Cartesian direction, revealed that the model’s prediction skills were best for the flow field in the planes perpendicular to the direction of prediction. Furthermore, the fusion of all three predicted tensors significantly increased the overall skills of the flow prediction over the individual model’s predictions. The useful forecast period of this new model was greater than 4 days with a root mean square error less than 0.05 cm·s−1 and a correlation coefficient of 0.6. 
    more » « less
  3. Abstract

    Forecasting the El Niño-Southern Oscillation (ENSO) has been a subject of vigorous research due to the important role of the phenomenon in climate dynamics and its worldwide socioeconomic impacts. Over the past decades, numerous models for ENSO prediction have been developed, among which statistical models approximating ENSO evolution by linear dynamics have received significant attention owing to their simplicity and comparable forecast skill to first-principles models at short lead times. Yet, due to highly nonlinear and chaotic dynamics (particularly during ENSO initiation), such models have limited skill for longer-term forecasts beyond half a year. To resolve this limitation, here we employ a new nonparametric statistical approach based on analog forecasting, called kernel analog forecasting (KAF), which avoids assumptions on the underlying dynamics through the use of nonlinear kernel methods for machine learning and dimension reduction of high-dimensional datasets. Through a rigorous connection with Koopman operator theory for dynamical systems, KAF yields statistically optimal predictions of future ENSO states as conditional expectations, given noisy and potentially incomplete data at forecast initialization. Here, using industrial-era Indo-Pacific sea surface temperature (SST) as training data, the method is shown to successfully predict the Niño 3.4 index in a 1998–2017 verification period out to a 10-month lead, which corresponds to an increase of 3–8 months (depending on the decade) over a benchmark linear inverse model (LIM), while significantly improving upon the ENSO predictability “spring barrier”. In particular, KAF successfully predicts the historic 2015/16 El Niño at initialization times as early as June 2015, which is comparable to the skill of current dynamical models. An analysis of a 1300-yr control integration of a comprehensive climate model (CCSM4) further demonstrates that the enhanced predictability afforded by KAF holds over potentially much longer leads, extending to 24 months versus 18 months in the benchmark LIM. Probabilistic forecasts for the occurrence of El Niño/La Niña events are also performed and assessed via information-theoretic metrics, showing an improvement of skill over LIM approaches, thus opening an avenue for environmental risk assessment relevant in a variety of contexts.

     
    more » « less
  4. Abstract

    Long‐lead forecasting for spatio‐temporal systems can entail complex nonlinear dynamics that are difficult to specify a priori. Current statistical methodologies for modeling these processes are often highly parameterized and, thus, challenging to implement from a computational perspective. One potential parsimonious solution to this problem is a method from the dynamical systems and engineering literature referred to as an echo state network (ESN). ESN models usereservoir computingto efficiently compute recurrent neural network forecasts. Moreover, multilevel (deep) hierarchical models have recently been shown to be successful at predicting high‐dimensional complex nonlinear processes, particularly those with multiple spatial and temporal scales of variability (such as those we often find in spatio‐temporal environmental data). Here, we introduce a deep ensemble ESN (D‐EESN) model. Despite the incorporation of a deep structure, the presented model is computationally efficient. We present two versions of this model for spatio‐temporal processes that produce forecasts and associated measures of uncertainty. The first approach utilizes a bootstrap ensemble framework, and the second is developed within a hierarchical Bayesian framework (BD‐EESN). This more general hierarchical Bayesian framework naturally accommodates non‐Gaussian data types and multiple levels of uncertainties. The methodology is first applied to a data set simulated from a novel non‐Gaussian multiscale Lorenz‐96 dynamical system simulation model and, then, to a long‐lead United States (U.S.) soil moisture forecasting application. Across both applications, the proposed methodology improves upon existing methods in terms of both forecast accuracy and quantifying uncertainty.

     
    more » « less
  5. Antarctic sea ice prediction has garnered increasing attention in recent years, particularly in the context of the recent record lows of February 2022 and 2023. As Antarctica becomes a climate change hotspot, as polar tourism booms, and as scientific expeditions continue to explore this remote continent, the capacity to anticipate sea ice conditions weeks to months in advance is in increasing demand. Spurred by recent studies that uncovered physical mechanisms of Antarctic sea ice predictability and by the intriguing large variations of the observed sea ice extent in recent years, the Sea Ice Prediction Network South (SIPN South) project was initiated in 2017, building upon the Arctic Sea Ice Prediction Network. The SIPN South project annually coordinates spring-to-summer predictions of Antarctic sea ice conditions, to allow robust evaluation and intercomparison, and to guide future development in polar prediction systems. In this paper, we present and discuss the initial SIPN South results collected over six summer seasons (December-February 2017-2018 to 2022-2023). We use data from 22 unique contributors spanning five continents that have together delivered more than 3000 individual forecasts of sea ice area and concentration. The SIPN South median forecast of the circumpolar sea ice area captures the sign of the recent negative anomalies, and the verifying observations are systematically included in the 10-90% range of the forecast distribution. These statements also hold at the regional level except in the Ross Sea where the systematic biases and the ensemble spread are the largest. A notable finding is that the group forecast, constructed by aggregating the data provided by each contributor, outperforms most of the individual forecasts, both at the circumpolar and regional levels. This indicates the value of combining predictions to average out model-specific errors. Finally, we find that dynamical model predictions (i.e., based on process-based general circulation models) generally perform worse than statistical model predictions (i.e., data-driven empirical models including machine learning) in representing the regional variability of sea ice concentration in summer. SIPN South is a collaborative community project that is hosted on a shared public repository. The forecast and verification data used in SIPN South are publicly available in near-real time for further use by the polar research community, and eventually, policymakers. 
    more » « less