Exploring Random Forest Machine Learning and Remote Sensing Data for Streamflow Prediction: An Alternative Approach to a Process-Based Hydrologic Modeling in a Snowmelt-Driven Watershed

Islam, Khandaker Iftekharul; Elias, Emile; Carroll, Kenneth C.; Brown, Christopher

doi:10.3390/rs15163999

Citation Details

This content will become publicly available on August 1, 2024

Exploring Random Forest Machine Learning and Remote Sensing Data for Streamflow Prediction: An Alternative Approach to a Process-Based Hydrologic Modeling in a Snowmelt-Driven Watershed

Physically based hydrologic models require significant effort and extensive information for development, calibration, and validation. The study explored the use of the random forest regression (RFR), a supervised machine learning (ML) model, as an alternative to the physically based Soil and Water Assessment Tool (SWAT) for predicting streamflow in the Rio Grande Headwaters near Del Norte, a snowmelt-dominated mountainous watershed of the Upper Rio Grande Basin. Remotely sensed data were used for the random forest machine learning analysis (RFML) and RStudio for data processing and synthesizing. The RFML model outperformed the SWAT model in accuracy and demonstrated its capability in predicting streamflow in this region. We implemented a customized approach to the RFR model to assess the model’s performance for three training periods, across 1991–2010, 1996–2010, and 2001–2010; the results indicated that the model’s accuracy improved with longer training periods, implying that the model trained on a more extended period is better able to capture the parameters’ variability and reproduce streamflow data more accurately. The variable importance (i.e., IncNodePurity) measure of the RFML model revealed that the snow depth and the minimum temperature were consistently the top two predictors across all training periods. The paper also evaluated how well the SWAT model performs in reproducing streamflow data of the watershed with a conventional approach. The SWAT model needed more time and data to set up and calibrate, delivering acceptable performance in annual mean streamflow simulation, with satisfactory index of agreement (d), coefficient of determination (R2), and percent bias (PBIAS) values, but monthly simulation warrants further exploration and model adjustments. The study recommends exploring snowmelt runoff hydrologic processes, dust-driven sublimation effects, and more detailed topographic input parameters to update the SWAT snowmelt routine for better monthly flow estimation. The results provide a critical analysis for enhancing streamflow prediction, which is valuable for further research and water resource management, including snowmelt-driven semi-arid regions.

Award ID(s):: 2142686

NSF-PAR ID:: 10479936

Author(s) / Creator(s):: Islam, Khandaker Iftekharul; Elias, Emile; Carroll, Kenneth C.; Brown, Christopher

Corporate Creator(s):: NA

Editor(s):: Thenkabail, Prasad S.

Publisher / Repository:: MDPI

Date Published:: 2023-08-01

Journal Name:: Remote Sensing

Volume:: 15

Issue:: 16

ISSN:: 2072-4292

Page Range / eLocation ID:: 3999

Subject(s) / Keyword(s):: ["streamflow prediction","random forest machine learning","hydrologic modeling","water resource management","remote sensing data","climate change"]

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on August 1, 2024
Journal Article:
https://doi.org/10.3390/rs15163999

More Like this