skip to main content

Title: A data-driven method for estimating the composition of end-members from stream water chemistry time series
Abstract. End-member mixing analysis (EMMA) is a method of interpreting stream water chemistry variations and is widely used for chemical hydrograph separation. It is based on the assumption that stream water is a conservative mixture of varying contributions from well-characterized source solutions (end-members). These end-members are typically identified by collecting samples of potential end-member source waters from within the watershed and comparing these to the observations. Here we introduce a complementary data-driven method (convex hull end-member mixing analysis – CHEMMA) to infer the end-member compositions and their associated uncertainties from the stream water observations alone. The method involves two steps. The first uses convex hull nonnegative matrix factorization (CH-NMF) to infer possible end-member compositions by searching for a simplex that optimally encloses the stream water observations. The second step uses constrained K-means clustering (COP-KMEANS) to classify the results from repeated applications of CH-NMF and analyzes the uncertainty associated with the algorithm. In an example application utilizing the 1986 to 1988 Panola Mountain Research Watershed dataset, CHEMMA is able to robustly reproduce the three field-measured end-members found in previous research using only the stream water chemical observations. CHEMMA also suggests that a fourth and a fifth end-member can be (less robustly) identified. We examine uncertainties in more » end-member identification arising from non-uniqueness, which is related to the data structure, of the CH-NMF solutions, and from the number of samples using both real and synthetic data. The results suggest that the mixing space can be identified robustly when the dataset includes samples that contain extremely small contributions of one end-member, i.e., samples containing extremely large contributions from one end-member are not necessary but do reduce uncertainty about the end-member composition. « less
Authors:
;
Award ID(s):
1654194
Publication Date:
NSF-PAR ID:
10336617
Journal Name:
Hydrology and Earth System Sciences
Volume:
26
Issue:
8
Page Range or eLocation-ID:
1977 to 1991
ISSN:
1607-7938
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract. Climate models predict amplified warming at high elevations in low latitudes,making tropical glacierized regions some of the most vulnerable hydrologicalsystems in the world. Observations reveal decreasing streamflow due toretreating glaciers in the Andes, which hold 99% of all tropicalglaciers. However, the timescales over which meltwater contributes tostreamflow and the pathways it takes – surface and subsurface – remainuncertain, hindering our ability to predict how shrinking glaciers willimpact water resources. Two major contributors to this uncertainty are thesparsity of hydrologic measurements in tropical glacierized watersheds andthe complication of hydrograph separation where there is year-round glaciermelt. We address these challenges using a multi-method approach that employsrepeat hydrochemical mixing model analysis, hydroclimatic time seriesanalysis, and integrated watershed modeling. Each of these approachesinterrogates distinct timescale relationships among meltwater, groundwater,and stream discharge. Our results challenge the commonly held conceptualmodel that glaciers buffer discharge variability. Instead, in a subhumidwatershed on Volcán Chimborazo, Ecuador, glacier melt drives nearly allthe variability in discharge (Pearson correlation coefficient of 0.89 insimulations), with glaciers contributing a broad range of 20%–60%or wider of discharge, mostly (86%) through surface runoff on hourlytimescales, but also through infiltration that increases annual groundwatercontributions by nearly 20%. We further found thatmore »rainfall may enhanceglacier melt contributions to discharge at timescales that complement glaciermelt production, possibly explaining why minimum discharge occurred at thestudy site during warm but dry El Niño conditions, which typicallyheighten melt in the Andes. Our findings caution against extrapolations fromisolated measurements: stream discharge and glacier melt contributions intropical glacierized systems can change substantially at hourly tointerannual timescales, due to climatic variability and surface to subsurfaceflow processes.

    « less
  2. Streams and rivers are significant sources of nitrous oxide (N2O), carbon dioxide (CO2), and methane (CH4) globally, and watershed management can alter greenhouse gas (GHG) emissions from streams. We hypothesized that urban infrastructure significantly alters downstream water quality and contributes to variability in GHG saturation and emissions. We measured gas saturation and estimated emission rates in headwaters of two urban stream networks (Red Run and Dead Run) of the Baltimore Ecosystem Study Long-Term Ecological Research project. We identified four combinations of stormwater and sanitary infrastructure present in these watersheds, including: (1) stream burial, (2) inline stormwater wetlands, (3) riparian/floodplain preservation, and (4) septic systems. We selected two first-order catchments in each of these categories and measured GHG concentrations, emissions, and dissolved inorganic and organic carbon (DIC and DOC) and nutrient concentrations biweekly for 1 year. From a water quality perspective, the DOC : NO3 ratio of streamwater was significantly different across infrastructure categories. Multiple linear regressions including DOC : NO3 and other variables (dissolved oxygen, DO; total dissolved nitrogen, TDN; and temperature) explained much of the statistical variation in nitrous oxide (N2O, r2 =  0.78), carbon dioxide (CO2, r2 =  0.78), and methane (CH4, r2 =  0.50) saturation in stream water. We measured N2O saturation ratios, which were among the highest reported in the literaturemore »for streams, ranging from 1.1 to 47 across all sites and dates. N2O saturation ratios were highest in streams draining watersheds with septic systems and strongly correlated with TDN. The CO2 saturation ratio was highly correlated with the N2O saturation ratio across all sites and dates, and the CO2 saturation ratio ranged from 1.1 to 73. CH4 was always supersaturated, with saturation ratios ranging from 3.0 to 2157. Longitudinal surveys extending form headwaters to third-order outlets of Red Run and Dead Run took place in spring and fall. Linear regressions of these data yielded significant negative relationships between each gas with increasing watershed size as well as consistent relationships between solutes (TDN or DOC, and DOC : TDN ratio) and gas saturation. Despite a decline in gas saturation between the headwaters and stream outlet, streams remained saturated with GHGs throughout the drainage network, suggesting that urban streams are continuous sources of CO2, CH4, and N2O. Our results suggest that infrastructure decisions can have significant effects on downstream water quality and greenhouse gases, and watershed management strategies may need to consider coupled impacts on urban water and air quality.« less
  3. Abstract
    This dataset includes rainfall, cloud, river and stream hydro-chemistry of the Plynlimon research catchments. The data is from weekly monitoring of stream hydrochemistry of the River Hafren (Severn) at both the Lower and Upper Hafren site from 1998, stream hydrochemistry of the River Hore at the Lower Hore site from 1983 and Upper Hore site from 1984 as well as rainfall hydrochemistry near the Carreg Wen meteorological site from 1983 and cloud hydrochemistry near the Carreg Wen meteorological site from 1990. Data for over 50 chemical determinands are presented alongside data for some in-situ measurements such as water temperature. Full descriptions of the analytical methods used for each determinand is included. The Plynlimon research catchments lie within the headwaters of the River Severn and the River Wye in the uplands of mid-Wales. Intensive and long-term monitoring within the catchments underpins a wealth of hydrological and hydro-chemical research; other linked datasets include river flow, meteorology and a variety of detailed spatial datasets representing the topography, soils and rivers of the catchments. Monitoring is funded by the Centre for Ecology & Hydrology, and is ongoing since 1968.
    Methods
    Originally designed to improve understanding of water use by coniferous forests, monitoring withinMore>>
  4. Abstract Many scientists use coronal hole (CH) detections to infer open magnetic flux. Detection techniques differ in the areas that they assign as open, and may obtain different values for the open magnetic flux. We characterize the uncertainties of these methods, by applying six different detection methods to deduce the area and open flux of a near-disk center CH observed on 2010 September 19, and applying a single method to five different EUV filtergrams for this CH. Open flux was calculated using five different magnetic maps. The standard deviation (interpreted as the uncertainty) in the open flux estimate for this CH ≈ 26%. However, including the variability of different magnetic data sources, this uncertainty almost doubles to 45%. We use two of the methods to characterize the area and open flux for all CHs in this time period. We find that the open flux is greatly underestimated compared to values inferred from in situ measurements (by 2.2–4 times). We also test our detection techniques on simulated emission images from a thermodynamic MHD model of the solar corona. We find that the methods overestimate the area and open flux in the simulated CH, but the average error in the flux ismore »only about 7%. The full-Sun detections on the simulated corona underestimate the model open flux, but by factors well below what is needed to account for the missing flux in the observations. Under-detection of open flux in coronal holes likely contributes to the recognized deficit in solar open flux, but is unlikely to resolve it.« less
  5. Abstract

    The Arctic is warming at twice the rate of the global mean. This warming could further stimulate methane (CH4) emissions from northern wetlands and enhance the greenhouse impact of this region. Arctic wetlands are extremely heterogeneous in terms of geochemistry, vegetation, microtopography, and hydrology, and therefore CH4fluxes can differ dramatically within the metre scale. Eddy covariance (EC) is one of the most useful methods for estimating CH4fluxes in remote areas over long periods of time. However, when the areas sampled by these EC towers (i.e. tower footprints) are by definition very heterogeneous, due to encompassing a variety of environmental conditions and vegetation types, modelling environmental controls of CH4emissions becomes even more challenging, confounding efforts to reduce uncertainty in baseline CH4emissions from these landscapes. In this study, we evaluated the effect of footprint variability on CH4fluxes from two EC towers located in wetlands on the North Slope of Alaska. The local domain of each of these sites contains well developed polygonal tundra as well as a drained thermokarst lake basin. We found that the spatiotemporal variability of the footprint, has a significant influence on the observed CH4fluxes, contributing between 3% and 33% of the variance, depending on site, time period,more »and modelling method. Multiple indices were used to define spatial heterogeneity, and their explanatory power varied depending on site and season. Overall, the normalised difference water index had the most consistent explanatory power on CH4fluxes, though generally only when used in concert with at least one other spatial index. The spatial bias (defined here as the difference between the mean for the 0.36 km2domain around the tower and the footprint-weighted mean) was between ∣51∣% and ∣18∣% depending on the index. This study highlights the need for footprint modelling to infer the representativeness of the carbon fluxes measured by EC towers in these highly heterogeneous tundra ecosystems, and the need to evaluate spatial variability when upscaling EC site-level data to a larger domain.

    « less