Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
                                            Some full text articles may not yet be available without a charge during the embargo (administrative interval).
                                        
                                        
                                        
                                            
                                                
                                             What is a DOI Number?
                                        
                                    
                                
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
- 
            Large-scale hydrologic models are increasingly being developed for operational use in the forecasting and planning of water resources. However, the predictive strength of such models depends on how well they resolve various functions of catchment hydrology, which are influenced by gradients in climate, topography, soils, and land use. Most assessments of hydrologic model uncertainty have been limited to traditional statistical methods. Here, we present a proof-of-concept approach that uses interpretable machine learning techniques to provide post hoc assessment of model sensitivity and process deficiency in hydrologic models. We train a random forest model to predict the Kling–Gupta efficiency (KGE) of National Water Model (NWM) and National Hydrologic Model (NHM) streamflow predictions for 4383 stream gauges in the conterminous United States. Thereafter, we explain the local and global controls that 48 catchment attributes exert on KGE prediction using interpretable Shapley values. Overall, we find that soil water content is the most impactful feature controlling successful model performance, suggesting that soil water storage is difficult for hydrologic models to resolve, particularly for arid locations. We identify nonlinear thresholds beyond which predictive performance decreases for NWM and NHM. For example, soil water content less than 210 mm, precipitation less than 900 mm yr−1, road density greater than 5 km km−2, and lake area percent greater than 10 % contributed to lower KGE values. These results suggest that improvements in how these influential processes are represented could result in the largest increases in NWM and NHM predictive performance. This study demonstrates the utility of interrogating process-based models using data-driven techniques, which has broad applicability and potential for improving the next generation of large-scale hydrologic models.more » « lessFree, publicly-accessible full text available September 17, 2026
- 
            Excess riverine nitrate causes downstream eutrophication, notably in the Gulf of Mexico where hypoxia is linked to nutrient-rich discharge from the Mississippi River Basin (MRB). We developed a long short-term memory (LSTM) model using high-frequency sensor data from across the conterminous US to predict daily nitrate concentrations, achieving strong temporal validation performance (median KGE = 0.60). Spatial validation—or prediction in unmonitored basins—yielded lower performance for nitrate concentration (median KGE = 0.18). Nonetheless, spatial validation was crucial in quantifying the impact of current data gaps and guiding the model's targeted application to the MRB where spatial validation performance was stronger (median KGE = 0.34). Modeling results for the MRB from 1980 to 2022 showed relatively low riverine nitrate export (19 ± 4% of surplus), indicating large-scale retention of surplus nitrate within the MRB. Interannual nitrate yields varied significantly, especially in Midwestern states like Iowa, where wet-year export fractions (42 ± 24%) far exceeded dry year export (6 ± 6%), suggesting increased hydrologic connectivity and remobilization of legacy nitrogen. Further evidence of legacy nitrate remobilization was noted in a subset of Midwestern basins where, on occasion, annual surplus export fractions exceeded 100%. Interpretable Shapley values identified key spatial drivers influencing mean nitrate concentrations—tile drainage, roadway density, wetland cover—and quantitative, non-linear thresholds in their influence, offering management targets. This study leverages machine learning and aquatic sensing to provide improved spatiotemporal predictions and insights into nitrate drivers, thresholds, and legacy impacts, offering valuable information for targeted nutrient management strategies in the MRB.more » « lessFree, publicly-accessible full text available August 1, 2026
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
