Abstract Forecast informed reservoir operations (FIRO) is an important advance in water management, but the design and testing of FIRO policies is limited by relatively short (10–35 year) hydro‐meteorological hindcasts. We present a novel, multisite model for synthetic forecast ensembles to overcome this limitation. This model utilizes parametric and non‐parametric procedures to capture complex forecast errors and maintain correlation between variables, lead times, locations, and ensemble members. After being fit to data from the hindcast period, this model can generate synthetic forecast ensembles in any period with observations. We demonstrate the approach in a case study of the FIRO‐based Ensemble Forecast Operations (EFO) control policy for the Lake Mendocino—Russian River basin, which conditions release decisions on ensemble forecasts from the Hydrologic Ensemble Forecast System (HEFS). We explore two generation strategies: (a) simulation of synthetic forecasts of meteorology to force HEFS; and (b) simulation of synthetic HEFS streamflow forecasts directly. We evaluate the synthetic forecasts using ensemble verification techniques and event‐based validation, finding good agreement with the actual ensemble forecasts. We then evaluate EFO policy performance using synthetic and actual forecasts over the hindcast period (1985–2010) and synthetic forecasts only over the pre‐hindcast period (1948–1984). Results show that the synthetic forecasts highlight important failure modes of the EFO policy under plausible forecast ensembles, but improvements are still needed to fully capture FIRO policy behavior under the actual forecast ensembles. Overall, the methodology advances a novel way to test FIRO policy robustness, which is key to building institutional support for FIRO.
more »
« less
Using Machine Learning Models for Short-Term Prediction of Dissolved Oxygen in a Microtidal Estuary
This paper presents a comprehensive approach to predicting short-term (for the upcoming 2 weeks) changes in estuarine dissolved oxygen concentrations via machine learning models that integrate historical water sampling, historical and upcoming 2-week meteorological data, and river discharge and discharge metrics. Dissolved oxygen is a critical indicator of ecosystem health, and this approach is implemented for the Neuse River Estuary, North Carolina, U.S.A., which has a long history of hypoxia-related habitat degradation. Through meticulous data preprocessing and feature selection, this research evaluates the predictions of dissolved oxygen concentrations by comparing a recurrent neural network with four other models, including a Multilayer Perceptron, Long Short-Term Memory, Gradient Boosting, and AutoKeras, through sensitivity experiments. The input predictors to our prediction models include water temperature, turbidity, chlorophyll-a, aggregated river discharge, and aggregated wind based on eight directions. By emphasizing the most impactful predictors, we streamlined the model-building processes and built a hindcast system from 2015 to 2019. We found that the recurrent neural network model was most effective in predicting the dissolved oxygen concentrations, with an R2 value of 0.99 at multiple stations. Different from our machine learning hindcast models that used observed upcoming meteorological and discharge data, an actual forecast system would use forecasted meteorological and discharge data. Therefore, an actual operational forecast may have lower accuracy than the hindcast, as determined by the accuracy of the predicted meteorological and discharge data. Nevertheless, our studies enhance our understanding of the factors influencing dissolved oxygen variability and set the basis for the implementation of a predictive tool for environmental monitoring and management. We also emphasized the importance of building station-specific models to improve the prediction results.
more »
« less
- Award ID(s):
- 2230046
- PAR ID:
- 10558390
- Publisher / Repository:
- MDPI
- Date Published:
- Journal Name:
- Water
- Volume:
- 16
- Issue:
- 14
- ISSN:
- 2073-4441
- Page Range / eLocation ID:
- 1998
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Streamflow prediction is crucial for planning future developments and safety measures along river basins, especially in the face of changing climate patterns. In this study, we utilized monthly streamflow data from the United States Bureau of Reclamation and meteorological data (snow water equivalent, temperature, and precipitation) from the various weather monitoring stations of the Snow Telemetry Network within the Upper Colorado River Basin to forecast monthly streamflow at Lees Ferry, a specific location along the Colorado River in the basin. Four machine learning models—Random Forest Regression, Long short-term memory, Gated Recurrent Unit, and Seasonal AutoRegresive Integrated Moving Average—were trained using 30 years of monthly data (1991–2020), split into 80% for training (1991–2014) and 20% for testing (2015–2020). Initially, only historical streamflow data were used for predictions, followed by including meteorological factors to assess their impact on streamflow. Subsequently, sequence analysis was conducted to explore various input-output sequence window combinations. We then evaluated the influence of each factor on streamflow by testing all possible combinations to identify the optimal feature combination for prediction. Our results indicate that the Random Forest Regression model consistently outperformed others, especially after integrating all meteorological factors with historical streamflow data. The best performance was achieved with a 24-month look-back period to predict 12 months of streamflow, yielding a Root Mean Square Error of 2.25 and R-squared (R2) of 0.80. Finally, to assess model generalizability, we tested the best model at other locations—Greenwood Springs (Colorado River), Maybell (Yampa River), and Archuleta (San Juan) in the basin.more » « less
-
null (Ed.)Abstract Some of the most intense convective storms on Earth initiate near the Sierras de Córdoba mountain range in Argentina. The goal of the RELAMPAGO field campaign was to observe these intense convective storms and their associated impacts. The intense observation period (IOP) occurred during November–December 2018. The two goals of the hydrometeorological component of RELAMPAGO IOP were 1) to perform hydrological streamflow and meteorological observations in previously ungauged basins and 2) to build a hydrometeorological modeling system for hindcast and forecast applications. During the IOP, our team was able to construct the stage–discharge curves in three basins, as hydrological instrumentation and personnel were successfully deployed based on RELAMPAGO weather forecasts. We found that the flood response time in these river locations is typically between 5 and 6 h from the peak of the rain event. The satellite-observed rainfall product IMERG-Final showed a better representation of rain gauge–estimated precipitation, while IMERG-Early and IMERG-Late had significant positive bias. The modeling component focuses on the 48-h simulation of an extreme hydrometeorological event that occurred on 27 November 2018. Using the Weather Research and Forecasting (WRF) atmospheric model and its hydrologic component WRF-Hydro as an uncoupled hydrologic model, we developed a system for hindcast, deterministic forecast, and a 60-member ensemble forecast initialized with regional-scale atmospheric data assimilation. Critically, our results highlight that streamflow simulations using the ensemble forecasting with data assimilation provide realistic flash flood forecast in terms of timing and magnitude of the peak. Our findings from this work are being used by the water managers in the region.more » « less
-
We collected this data to better understand the timing of peak benthic cyanobacterial mat occurrence (specifically taxa associated with anatoxin production, Microcoleus and Anabaena) and mat anatoxin concentrations in rivers. We sampled in northern California on the South Fork Eel, Salmon, and Russian Rivers biweekly in 2022, and the Salmon River biweekly and South Fork Eel weekly in 2023. During each sampling event, we conducted benthic cover surveys, measured in-situ water quality parameters (temperature, pH, dissolved oxygen, conductivity), and collected surface water samples and targeted cyanobacteria samples. In 2022 on all rivers and in 2023 at the Salmon River, we also collected distributed non-targeted periphyton samples to characterize full-reach community compositions. All sampling was completed in 150-m reaches upstream of sensors recording continuous dissolved oxygen, conductivity, and temperature data. We analyzed surface water samples for nitrate, ammonium, soluble reactive phosphate, total dissolved carbon, and dissolved organic carbon. We also analyzed surface water samples from 2022 for major anions (Cl, SO4, Br) and cations (Na, K, Mg, Ca). Targeted-cyanobacteria and non-target periphyton samples were analyzed for anatoxins, relative abundance of algal taxa (via microscopy), ash-free dry mass, and chlorophyll-a. To estimate mean river depth within the dissolved oxygen footprint upstream of sensors, we kayaked portions of the river and collected river depth measurements. We also measured discharge at each river excluding the Salmon River (due to high discharge) and completed pebble counts at the South Fork Eel River to obtain sediment grain size distributions.more » « less
-
Power grid operators rely on solar irradiance forecasts to manage uncertainty and variability associated with solar power. Meteorological factors such as cloud cover, wind direction, and wind speed affect irradiance and are associated with a high degree of variability and uncertainty. Statistical models fail to accurately capture the dependence between these factors and irradiance. In this paper, we introduce the idea of applying multivariate Gated Recurrent Units (GRU) to forecast Direct Normal Irradiance (DNI) hourly. The proposed GRU-based forecasting method is evaluated against traditional Long Short-Term Memory (LSTM) using historical irradiance data (i.e., weather variables that include cloud cover, wind direction, and wind speed) to forecast irradiance forecasting over intra-hour and inter-hour intervals. Our evaluation on one of the sites from Measurement and Instrumentation Data Center indicate that both GRU and LSTM improved DNI forecasting performance when evaluated under different conditions. Moreover, including wind direction and wind speed can have substantial improvement in the accuracy of DNI forecasts. Besides, the forecasting model can accurately forecast irradiance values over multiple forecasting horizons.more » « less
An official website of the United States government

