skip to main content

Title: A Long-term Consistent Artificial Intelligence and Remote Sensing-based Soil Moisture Dataset

The Consistent Artificial Intelligence (AI)-based Soil Moisture (CASM) dataset is a global, consistent, and long-term, remote sensing soil moisture (SM) dataset created using machine learning. It is based on the NASA Soil Moisture Active Passive (SMAP) satellite mission SM data and is aimed at extrapolating SMAP-like quality SM back in time using previous satellite microwave platforms. CASM represents SM in the top soil layer, and it is defined on a global 25 km EASE-2 grid and for 2002–2020 with a 3-day temporal resolution. The seasonal cycle is removed for the neural network training to ensure its skill is targeted at predicting SM extremes. CASM comparison to 367 globalin-situSM monitoring sites shows a SMAP-like median correlation of 0.66. Additionally, the SM product uncertainty was assessed, and both aleatoric and epistemic uncertainties were estimated and included in the dataset. CASM dataset can be used to study a wide range of hydrological, carbon cycle, and energy processes since only a consistent long-term dataset allows assessing changes in water availability and water stress.

more » « less
Author(s) / Creator(s):
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Data
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    In-depth knowledge about the global patterns and dynamics of land surface net water flux (NWF) is essential for quantification of depletion and recharge of groundwater resources. Net water flux cannot be directly measured, and its estimates as a residual of individual surface flux components often suffer from mass conservation errors due to accumulated systematic biases of individual fluxes. Here, for the first time, we provide direct estimates of global NWF based on near-surface satellite soil moisture retrievals from the Soil Moisture Ocean Salinity (SMOS) and Soil Moisture Active Passive (SMAP) satellites. We apply a recently developed analytical model derived via inversion of the linearized Richards’ equation. The model is parsimonious, yet yields unbiased estimates of long-term cumulative NWF that is generally well correlated with the terrestrial water storage anomaly from the Gravity Recovery and Climate Experiment (GRACE) satellite. In addition, in conjunction with precipitation and evapotranspiration retrievals, the resultant NWF estimates provide a new means for retrieving global infiltration and runoff from satellite observations. However, the efficacy of the proposed approach over densely vegetated regions is questionable, due to the uncertainty of the satellite soil moisture retrievals and the lack of explicit parameterization of transpiration by deeply rooted plants in the proposed model. Future research is needed to advance this modeling paradigm to explicitly account for plant transpiration.

    more » « less
  2. Abstract

    Irrigation representation in land surface models has been advanced over the past decade, but the soil moisture (SM) data from SMAP satellite have not yet been utilized in large‐scale irrigation modeling. Here we investigate the potential of improving irrigation representation in the Community Land Model version‐4.5 (CLM4.5) by assimilating SMAP data. Simulations are conducted over the heavily irrigated central U.S. region. We find that constraining the target SM in CLM4.5 using SMAP data assimilation with 1‐D Kalman filter reduces the root‐mean‐square error of simulated irrigation water requirement by 50% on average (for Nebraska, Kansas, and Texas) and significantly improves irrigation simulations by reducing the bias in irrigation water requirement by up to 60%. An a priori bias correction of SMAP data further improves these results in some regions but incrementally. Data assimilation also enhances SM simulations in CLM4.5. These results could provide a basis for improved modeling of irrigation and land‐atmosphere interactions.

    more » « less
  3. null (Ed.)
    Abstract Soil moisture (SM) and evapotranspiration (ET) are key variables of the terrestrial water cycle with a strong relationship. This study examines remotely sensed soil moisture and evapotranspiration data assimilation (DA) with the aim of improving drought monitoring. Although numerous efforts have gone into assimilating satellite soil moisture observations into land surface models to improve their predictive skills, little attention has been given to the combined use of soil moisture and evapotranspiration to better characterize hydrologic fluxes. In this study, we assimilate two remotely sensed datasets, namely, Soil Moisture Operational Product System (SMOPS) and MODIS evapotranspiration (MODIS16 ET), at 1-km spatial resolution, into the VIC land surface model by means of an evolutionary particle filter method. To achieve this, a fully parallelized framework based on model and domain decomposition using a parallel divide-and-conquer algorithm was implemented. The findings show improvement in soil moisture predictions by multivariate assimilation of both ET and SM as compared to univariate scenarios. In addition, monthly and weekly drought maps are produced using the updated root-zone soil moisture percentiles over the Apalachicola–Chattahoochee–Flint basin in the southeastern United States. The model-based estimates are then compared against the corresponding U.S. Drought Monitor (USDM) archive maps. The results are consistent with the USDM maps during the winter and spring season considering the drought extents; however, the drought severity was found to be slightly higher according to DA method. Comparing different assimilation scenarios showed that ET assimilation results in wetter conditions comparing to open-loop and univariate SM DA. The multivariate DA then combines the effects of the two variables and provides an in-between condition. 
    more » « less
  4. Abstract

    Fires that emit massive amounts of CO2and particulate matter now burn with regularity in Southeast Asian tropical peatlands. Natural peatlands in Southeast Asia are waterlogged for most of the year and experience little or no fire, but networks of canals constructed for agriculture have drained vast areas of these peatlands, making the soil vulnerable to fire during periods of low rainfall. While soil moisture is the most direct measure of peat flammability, it has not been incorporated into fire studies due to an absence of regional observations. Here, we create the first remotely sensed soil moisture dataset for tropical peatlands in Sumatra, Borneo and Peninsular Malaysia by applying a new retrieval algorithm to satellite data from the Soil Moisture Active Passive (SMAP) mission with data spanning the 2015 El Niño burning event. Drier soil up to 30 days prior to fire correlates with larger burned area. The predictive information provided by soil moisture complements that of precipitation. Our remote sensing-derived results mirror those from a laboratory-based peat ignition study, suggesting that the dependence of fire on soil moisture exhibits scale independence within peatlands. Soil moisture measured from SMAP, a dataset spanning 2015-present, is a valuable resource for peat fire studies and warning systems.

    more » « less
  5. Abstract

    Recently, recurrent deep networks have shown promise to harness newly available satellite‐sensed data for long‐term soil moisture projections. However, to be useful in forecasting, deep networks must also provide uncertainty estimates. Here we evaluated Monte Carlo dropout with an input‐dependent data noise term (MCD+N), an efficient uncertainty estimation framework originally developed in computer vision, for hydrologic time series predictions. MCD+N simultaneously estimates a heteroscedastic input‐dependent data noise term (a trained error model attributable to observational noise) and a network weight uncertainty term (attributable to insufficiently constrained model parameters). Although MCD+N has appealing features, many heuristic approximations were employed during its derivation, and rigorous evaluations and evidence of its asserted capability to detect dissimilarity were lacking. To address this, we provided an in‐depth evaluation of the scheme's potential and limitations. We showed that for reproducing soil moisture dynamics recorded by the Soil Moisture Active Passive (SMAP) mission, MCD+N indeed gave a good estimate of predictive error, provided that we tuned a hyperparameter and used a representative training data set. The input‐dependent term responded strongly to observational noise, while the model term clearly acted as a detector for physiographic dissimilarity from the training data, behaving as intended. However, when the training and test data were characteristically different, the input‐dependent term could be misled, undermining its reliability. Additionally, due to the data‐driven nature of the model, data noise also influences network weight uncertainty, and therefore the two uncertainty terms are correlated. Overall, this approach has promise, but care is needed to interpret the results.

    more » « less