In situ sensors that collect high-frequency data are used increasingly to monitor aquatic environments. These sensors are prone to technical errors, resulting in unrecorded observations and/or anomalous values that are subsequently removed and create gaps in time series data. We present a framework based on generalized additive and auto-regressive models to recover these missing data. To mimic sporadically missing (i) single observations and (ii) periods of contiguous observations, we randomly removed (i) point data and (ii) day- and week-long sequences of data from a two-year time series of nitrate concentration data collected from Arikaree River, USA, where synoptically collected water temperature, turbidity, conductance, elevation, and dissolved oxygen data were available. In 72% of cases with missing point data, predicted values were within the sensor precision interval of the original value, although predictive ability declined when sequences of missing data occurred. Precision also depended on the availability of other water quality covariates. When covariates were available, even a sudden, event-based peak in nitrate concentration was reconstructed well. By providing a promising method for accurate prediction of missing data, the utility and confidence in summary statistics and statistical trends will increase, thereby assisting the effective monitoring and management of fresh waters and other at-risk ecosystems. 
                        more » 
                        « less   
                    
                            
                            Understanding links between water-quality variables and nitrate concentration in freshwater streams using high frequency sensor data
                        
                    
    
            Real-time monitoring usingin-situsensors is becoming a common approach for measuring water-quality within watersheds. High-frequency measurements produce big datasets that present opportunities to conduct new analyses for improved understanding of water-quality dynamics and more effective management of rivers and streams. Of primary importance is enhancing knowledge of the relationships between nitrate, one of the most reactive forms of inorganic nitrogen in the aquatic environment, and other water-quality variables. We analysed high-frequency water-quality data fromin-situsensors deployed in three sites from different watersheds and climate zones within the National Ecological Observatory Network, USA. We used generalised additive mixed models to explain the nonlinear relationships at each site between nitrate concentration and conductivity, turbidity, dissolved oxygen, water temperature, and elevation. Temporal auto-correlation was modelled with an auto-regressive–moving-average (ARIMA) model and we examined the relative importance of the explanatory variables. Total deviance explained by the models was high for all sites (99%). Although variable importance and the smooth regression parameters differed among sites, the models explaining the most variation in nitrate contained the same explanatory variables. This study demonstrates that building a model for nitrate using the same set of explanatory water-quality variables is achievable, even for sites with vastly different environmental and climatic characteristics. Applying such models will assist managers to select cost-effective water-quality variables to monitor when the goals are to gain a spatial and temporal in-depth understanding of nitrate dynamics and adapt management plans accordingly. 
        more » 
        « less   
        
    
    
                            - PAR ID:
- 10494127
- Editor(s):
- Larsen, Stefano
- Publisher / Repository:
- PLOS One
- Date Published:
- Journal Name:
- PLOS ONE
- Volume:
- 18
- Issue:
- 6
- ISSN:
- 1932-6203
- Page Range / eLocation ID:
- e0287640
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
- 
            
- 
            Water quality is affected by multiple spatial and temporal factors, including the surrounding land characteristics, human activities, and antecedent precipitation amounts. However, identifying the relationships between water quality and spatially and temporally varying environmental variables with a machine learning technique in a heterogeneous urban landscape has been understudied. We explore how seasonal and variable precipitation amounts and other small-scale landscape variables affect E. coli, total suspended solids (TSS), nitrogen-nitrate, orthophosphate, lead, and zinc concentrations in Portland, Oregon, USA. Mann–Whitney tests were used to detect differences in water quality between seasons and COVID-19 periods. Spearman’s rank correlation analysis was used to identify the relationship between water quality and explanatory variables. A Random Forest (RF) model was used to predict water quality using antecedent precipitation amounts and landscape variables as inputs. The performance of RF was compared with that of ordinary least squares (OLS). Mann–Whitney tests identified statistically significant differences in all pollutant concentrations (except TSS) between the wet and dry seasons. Nitrate was the only pollutant to display statistically significant reductions in median concentrations (from 1.5 mg/L to 1.04 mg/L) during the COVID-19 lockdown period, likely associated with reduced traffic volumes. Spearman’s correlation analysis identified the highest correlation coefficients between one-day precipitation amounts and E. coli, lead, zinc, and TSS concentrations. Road length is positively associated with E. coli and zinc. The Random Forest (RF) model best predicts orthophosphate concentrations (R2 = 0.58), followed by TSS (R2 = 0.54) and nitrate (R2 = 0.46). E. coli was the most difficult to model and had the highest RMSE, MAE, and MAPE values. Overall, the Random Forest model outperformed OLS, as evaluated by RMSE, MAE, MAPE, and R2. The Random Forest was an effective approach to modeling pollutant concentrations using both categorical seasonal and COVID data along with continuous rain and landscape variables to predict water quality in urban streams. Implementing optimization techniques can further improve the model’s performance and allow researchers to use a machine learning approach for water quality modeling.more » « less
- 
            ABSTRACT Agricultural nitrate pollution is a major threat to water quality in Iowa. Iowa uses a majority of its land for row crop agriculture and maintains a large livestock population, which together cause high nitrate loads in streams. High‐frequency stream nitrate data can aid policy decisions for reducing nitrate emissions by identifying streams with high nitrate loads, historical trends of improvement or deterioration in nitrate loads, and land use or practice changes that affect water quality. We developed a time series regression model framework to supplement existing sensor data and predict daily nitrate loads in Iowa streams lacking nitrate monitoring. Using nitrate data from statewide and national resources, this framework was trained and validated using 11 study sites of diverse geography and land use in Iowa. Partial least squares regression (PLSR) was used with geographical predictors, including land use, hydrogeology, and meteorology, to predict streamflow‐nitrate load relationships across the study sites. The developed PLSR model, combined with daily streamflow data, was then used to predict daily nitrate loads with high accuracy over a three‐year study period with a mean Kling–Gupta Efficiency of 0.74. Our framework was then used to estimate mean nitrate concentrations at 34 sites that lack nitrate sensors, demonstrating a low‐cost, facile method for the accurate prediction of daily nitrate loads in Iowa streams.more » « less
- 
            <italic>Abstract</italic> The steadily rising global urban population has placed substantial strain on urban water quality, and this strain is projected to increase for the foreseeable future. Considerable attention has been given to the hydrological and physico‐chemical effects of urbanization on stream ecosystems. However, due to the relative infancy of the field of urban ecology, long‐term water quality analyses in urban streams are sparse. Using a 15‐yr stream chemistry monitoring record from Baltimore, Maryland, we quantified long‐term trends in nitrate, phosphate, total nitrogen, total phosphorus, chloride, and sulfate export at several sites along a rural–urban gradient. We found no significant change in solute export at most sites, although we did find specific patterns of interest for certain solutes. For example, nitrogen export declined at the most headwater urban site, while phosphorus export declined at the most downstream urban site. Coupling long‐term monitoring with data on gray and green infrastructure management throughout the landscape, we established relationships between solute export at the most downstream urban monitoring site and sanitary sewer overflows (SSOs), best management practice (BMP) implementation, and road salt application rates. Phosphorus export was correlated with BMP implementation in the watershed, whereas nitrogen export was related to SSOs. Despite highly urbanized watersheds, water quality does not appear to be declining at most of these sites, suggesting that current management may have limited further impairment. Results of our study suggest that both gray and green infrastructure are key for maintaining and improving water quality in this highly urbanized watershed.more » « less
- 
            Landowners and natural resource agencies are seeking to better understand the benefits of best management practices (BMPs) for addressing water quality issues. Using edge-of-field and edge-of-farm runoff analysis, we compared runoff volumes and water quality between small watersheds where BMPs (e.g., prescribed grazing, silvicultural practices) were implemented and control watersheds managed using conventional practices (i.e., continuous grazing, natural forest revegetation). Flow-weighted samples, collected over a 2-year period using automated samplers, were analyzed for nitrate/nitrite nitrogen (NNN), total Kjeldahl nitrogen (TKN), total phosphorus (P), ortho-phosphate phosphorous (OP), total suspended solids (TSS), and Escherichia coli (E. coli). Comparison of silvicultural planting to conventional reforestation practices showed a significant decrease in NNN loads (p < 0.05) but no significant differences in TKN, P, OP, TSS, or E. coli. Continuously grazed sites yielded >24% more runoff than sites that were under prescribed grazing regimes, despite receiving less total rainfall. Likewise, NNN, TSS, and TKN loadings were significantly lower under prescribed grazing management than on conventionally grazed sites (p < 0.05). Data suggests that grazing BMPs can be an effective tool for rapidly improving water quality. However, silvicultural BMPs require more time (i.e., >2 years) to establish and achieve detectable improvements.more » « less
 An official website of the United States government
An official website of the United States government 
				
			 
					 
					
 
                                    