skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: MELPF version 1: Modeling Error Learning based Post-Processor Framework for Hydrologic Models Accuracy Improvement
Abstract. This paper studies how to improve the accuracy of hydrologic models using machine-learning models as post-processors and presents possibilities to reduce the workload to create an accurate hydrologic model by removing the calibration step. It is often challenging to develop an accurate hydrologic model due to the time-consuming model calibration procedure and the nonstationarity of hydrologic data. Our findings show that the errors of hydrologic models are correlated with model inputs. Thus motivated, we propose a modeling-error-learning-based post-processor framework by leveraging this correlation to improve the accuracy of a hydrologic model. The key idea is to predict the differences (errors) between the observed values and the hydrologic model predictions by using machine-learning techniques. To tackle the nonstationarity issue of hydrologic data, a moving-window-based machine-learning approach is proposed to enhance the machine-learning error predictions by identifying the local stationarity of the data using a stationarity measure developed based on the Hilbert–Huang transform. Two hydrologic models, the Precipitation–Runoff Modeling System (PRMS) and the Hydrologic Modeling System (HEC-HMS), are used to evaluate the proposed framework. Two case studies are provided to exhibit the improved performance over the original model using multiple statistical metrics.  more » « less
Award ID(s):
1838024
PAR ID:
10129638
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Geoscientific Model Development
Volume:
12
Issue:
9
ISSN:
1991-9603
Page Range / eLocation ID:
4115 to 4131
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Stochastic Watershed Models (SWMs) are emerging tools in hydrologic modeling used to propagate uncertainty into model predictions by adding samples of model error to deterministic simulations. One of the most promising uses of SWMs is uncertainty propagation for hydrologic simulations under climate change. However, a core challenge is that the historical predictive uncertainty may not correctly characterize the error distribution under future climate. For example, the frequency of physical processes (e.g., snow accumulation and melt) may change under climate change, and so too may the frequency of errors associated with those processes. In this work, we explore for the first time non‐stationarity in hydrologic model errors under climate change in an idealized experimental design. We fit one hydrologic model to historical observations, and then fit a second model to the simulations of the first, treating the first model as the true hydrologic system. We then force both models with climate change impacted meteorology and investigate changes to the error distribution between the models. We develop a hybrid machine learning method that maps model state variables to predictive errors, allowing for non‐stationary error distributions based on changes in the frequency of model states. We find that this procedure provides an internally consistent methodology to overcome stationarity assumptions in error modeling and offers an important advance for implementing SWMs under climate change. We test this method on three hydrologically distinct watersheds in California (Feather River, Sacramento River, Calaveras River), finding that the hybrid model performs best in larger and less flashy basins. 
    more » « less
  2. Abstract The World Magnetic Model (WMM) is a geomagnetic main field model that is widely used for navigation by governments, industry and the general public. In recent years, the model has been derived using high accuracy magnetometer data from the Swarm mission. This study explores the possibility of developing future WMMs in the post-Swarm era using data from the Iridium satellite constellation. Iridium magnetometers are primarily used for attitude control, so they are not designed to produce the same level of accuracy as magnetic data from scientific missions. Iridium magnetometer errors range from 30 nT quantization to hundreds of nT errors due to spacecraft contamination and calibration uncertainty, whereas Swarm measurements are accurate to about 1 nT. The calibration uncertainty in the Iridium measurements is identified as a major error source, and a method is developed to calibrate the spacecraft measurements using data from a subset of the INTERMAGNET observatory network producing quasi-definitive data on a regular basis. After calibration, the Iridium data produced main field models with approximately 20 nT average error and 40 nT maximum error as compared to the CHAOS-7.2 model. For many scientific and precision navigation applications, highly accurate Swarm-like measurements are still necessary, however, the Iridium-based models were shown to meet the WMM error tolerances, indicating that Iridium is a viable data source for future WMMs. Graphical Abstract 
    more » « less
  3. Abstract Empirical thresholds for landslide warning systems have benefitted from the incorporation of soil‐hydrologic monitoring data, but the mechanistic basis for their predictive capabilities is limited. Although physically based hydrologic models can accurately simulate changes in soil moisture and pore pressure that promote landslides, their utility is restricted by high computational costs and nonunique parameterization issues. We construct a deep learning model using soil moisture, pore pressure, and rainfall monitoring data acquired from landslide‐prone hillslopes in Oregon, USA, to predict the timing and magnitude of hydrologic response at multiple soil depths for 36‐hr intervals. We find that observation records as short as 6 months are sufficient for accurate predictions, and our model captures hydrologic response for high‐intensity rainfall events even when those storm types are excluded from model training. We conclude that machine learning can provide an accurate and computationally efficient alternative to empirical methods or physical modeling for landslide hazard warning. 
    more » « less
  4. Abstract Obtaining accurate estimates of machine learning model uncertainties on newly predicted data is essential for understanding the accuracy of the model and whether its predictions can be trusted. A common approach to such uncertainty quantification is to estimate the variance from an ensemble of models, which are often generated by the generally applicable bootstrap method. In this work, we demonstrate that the direct bootstrap ensemble standard deviation is not an accurate estimate of uncertainty but that it can be simply calibrated to dramatically improve its accuracy. We demonstrate the effectiveness of this calibration method for both synthetic data and numerous physical datasets from the field of Materials Science and Engineering. The approach is motivated by applications in physical and biological science but is quite general and should be applicable for uncertainty quantification in a wide range of machine learning regression models. 
    more » « less
  5. Ab initio microkinetic modeling, parameterized using density functional theory (DFT) energies, is a common tool to quantify reaction rates and analyze reaction mechanisms a priori in heterogeneous catalysis. Such models, however, often have large prediction errors even if they include plausible reaction steps and correctly model the active sites; this is partially due to the intrinsic inaccuracies of the chosen DFT functional. Borrowing concepts from Bayesian calibration theory, we show that transferable data-driven corrections to DFT energies in the form of Gaussian process models trained on single-crystal adsorption calorimetry data can improve the accuracy of microkinetic models substantially. Specifically, we demonstrate that such corrections improve the predictive accuracy of the microkinetic model of the water-gas shift reaction on single-crystal Cu(111) surface by 3 orders of magnitude. We finally show that Gaussian process corrections serve as informed priors in a Bayesian experimental design framework to learn an accurate a posteriori microkinetic model from few kinetic experiments. We posit that these results suggest that even infusing small, related, high-fidelity thermochemistry data, when available, can systematically and substantially improve the predictive accuracy of microkinetic models. 
    more » « less