skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Continental Scale Hydrostratigraphy: Comparing Geologically Informed Data Products to Analytical Solutions
Abstract This study synthesizes two different methods for estimating hydraulic conductivity (K) at large scales. We derive analytical approaches that estimate K and apply them to the contiguous United States. We then compare these analytical approaches to three‐dimensional, national gridded K data products and three transmissivity (T) data products developed from publicly available sources. We evaluate these data products using multiple approaches: comparing their statistics qualitatively and quantitatively and with hydrologic model simulations. Some of these datasets were used as inputs for an integrated hydrologic model of the Upper Colorado River Basin and the comparison of the results with observations was used to further evaluate the K data products. Simulated average daily streamflow was compared to daily flow data from 10 USGS stream gages in the domain, and annually averaged simulated groundwater depths are compared to observations from nearly 2000 monitoring wells. We find streamflow predictions from analytically informed simulations to be similar in relative bias and Spearman's rho to the geologically informed simulations.R‐squared values for groundwater depth predictions are close between the best performing analytically and geologically informed simulations at 0.68 and 0.70 respectively, with RMSE values under 10 m. We also show that the analytical approach derived by this study produces estimates of K that are similar in spatial distribution, standard deviation, mean value, and modeling performance to geologically‐informed estimates. The results of this work are used to inform a follow‐on study that tests additional data‐driven approaches in multiple basins within the contiguous United States.  more » « less
Award ID(s):
2054506 1835794
PAR ID:
10473644
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
Wiley
Date Published:
Journal Name:
Groundwater
ISSN:
0017-467X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. In dry summer months, stream baseflow sourced from groundwater is essential to support aquatic ecosystems and anthropogenic water use. Hydrologic signatures, or metrics describing unique features of streamflow timeseries, are useful for quantifying and predicting these valuable baseflow and groundwater storage resources across continental scales. Hydrologic signatures can be predicted based on catchment attributes summarising climate and landscape and can be used to characterise baseflow and groundwater processes that cannot be directly measured. While past watershed‐scale studies suggest that landscape attributes are important controls on baseflow and storage processes, recent regional‐to‐global scale modelling studies have instead found that landscape attributes have weaker relationships with hydrologic signatures of these processes than expected compared to climate attributes. In this study, we quantify two landscape attributes, average geologic age and the proportion of catchment area covered by wetlands. We investigate if incorporating these additional predictors into existing large‐sample attribute datasets strengthens continental‐scale, empirical relationships between landscape attributes and hydrologic signatures. We quantify 14 hydrologic signatures related to baseflow and groundwater processes in catchments across the contiguous United States, evaluate the relationships between the new catchment attributes and hydrologic signatures with correlation analysis and use the new attributes to predict hydrologic signatures with random forest models. We found that the average geologic age of catchments was a highly influential predictor of hydrologic signatures, especially for signatures describing baseflow magnitude in catchments, and had greater importance than existing attributes of the subsurface. In contrast, we found that the proportion of wetlands in catchments had limited influence on our hydrologic signature predictions. We recommend incorporating catchment geologic age into large‐sample catchment datasets to improve predictions of baseflow and storage hydrologic signatures and processes across continental scales. 
    more » « less
  2. null (Ed.)
    The quality of input data and the process of watershed delineation can affect the accuracy of runoff predictions in watershed modeling. The Upper Mississippi River Basin was selected to evaluate the effects of subbasin and/or hydrologic response unit (HRU) delineations and the density of climate dataset on the simulated streamflow and water balance components using the Hydrologic and Water Quality System (HAWQS) platform. Five scenarios were examined with the same parameter set, including 8- and 12-digit hydrologic unit codes, two levels of HRU thresholds and two climate data densities. Results showed that statistic evaluations of monthly streamflow from 1983 to 2005 were satisfactory at some gauge sites but were relatively worse at others when shifting from 8-digit to 12-digit subbasins, revealing that the hydrologic response to delineation schemes can vary across a large basin. Average channel slope and drainage density increased significantly from 8-digit to 12-digit subbasins. This resulted in higher lateral flow and groundwater flow estimates, especially for the lateral flow. Moreover, a finer HRU delineation tends to generate more runoff because it captures a refined level of watershed spatial variability. The analysis of climate datasets revealed that denser climate data produced higher predicted runoff, especially for summer months. 
    more » « less
  3. Abstract Integrated hydrologic models can simulate coupled surface and subsurface processes but are computationally expensive to run at high resolutions over large domains. Here we develop a novel deep learning model to emulate subsurface flows simulated by the integrated ParFlow‐CLM model across the contiguous US. We compare convolutional neural networks like ResNet and UNet run autoregressively against our novel architecture called the Forced SpatioTemporal RNN (FSTR). The FSTR model incorporates separate encoding of initial conditions, static parameters, and meteorological forcings, which are fused in a recurrent loop to produce spatiotemporal predictions of groundwater. We evaluate the model architectures on their ability to reproduce 4D pressure heads, water table depths, and surface soil moisture over the contiguous US at 1 km resolution and daily time steps over the course of a full water year. The FSTR model shows superior performance to the baseline models, producing stable simulations that capture both seasonal and event‐scale dynamics across a wide array of hydroclimatic regimes. The emulators provide over 1,000× speedup compared to the original physical model, which will enable new capabilities like uncertainty quantification and data assimilation for integrated hydrologic modeling that were not previously possible. Our results demonstrate the promise of using specialized deep learning architectures like FSTR for emulating complex process‐based models without sacrificing fidelity. 
    more » « less
  4. Abstract Accurately estimating stream discharge is crucial for many ecological, biogeochemical, and hydrologic analyses. As of September 2022, The National Ecological Observatory Network (NEON) provided up to 5 years of continuous discharge estimates at 28 streams across the United States. NEON created rating curves at each site in a Bayesian framework, parameterized using hydraulic controls and manual measurements of discharge. Here we evaluate the reliability of these discharge estimates with three approaches. We (1) compared predicted to observed discharge, (2) compared predicted to observed stage, and (3) calculated the proportion of discharge estimates extrapolated beyond field measurements. We considered 1,523 site-months of continuous streamflow predictions published by NEON. Of these, 39% met our highest quality criteria, 11% fell into an intermediate classification, and 50% of site-months were classified as unreliable. We provided diagnostic metrics and categorical evaluations of continuous discharge and stage estimates by month for each site, enabling users to rapidly query for suitable NEON data. 
    more » « less
  5. null (Ed.)
    Stream water temperature (Ts) is a variable of critical importance for aquatic ecosystem health. Ts is strongly affected by groundwater-surface water interactions which can be learned from streamflow records, but previously such information was challenging to effectively absorb with process-based models due to parameter equifinality. Based on the long short-term memory (LSTM) deep learning architecture, we developed a basin-centric lumped daily mean Ts model, which was trained over 118 data-rich basins with no major dams in the conterminous United States, and showed strong results. At a national scale, we obtained a median root-mean-square error (RMSE) of 0.69oC, Nash-Sutcliffe model efficiency coefficient (NSE) of 0.985, and correlation of 0.994, which are marked improvements over previous values reported in literature. The addition of streamflow observations as a model input strongly elevated the performance of this model. In the absence of measured streamflow, we showed that a two-stage model can be used where simulated streamflow from a pre-trained LSTM model (Qsim) still benefits the Ts model, even though no new information was brought directly in the inputs of the Ts model; the model indirectly used information learned from streamflow observations provided during the training of Qsim, potentially to improve internal representation of physically meaningful variables. Our results indicate that strong relationships exist between basin-averaged forcing variables, catchment attributes, and Ts that can be simulated by a single model trained by data on the continental scale. 
    more » « less