skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Using neural network ensembles to separate ocean biogeochemical and physical drivers of phytoplankton biogeography in Earth system models
Abstract. Earth system models (ESMs) are useful tools forpredicting and understanding past and future aspects of the climate system.However, the biological and physical parameters used in ESMs can have widevariations in their estimates. Even small changes in these parameters canyield unexpected results without a clear explanation of how a particularoutcome was reached. The standard method for estimating ESM sensitivity isto compare spatiotemporal distributions of variables from different runs ofa single ESM. However, a potential pitfall of this method is that ESM outputcould match observational patterns because of compensating errors. Forexample, if a model predicts overly weak upwelling and low nutrientconcentrations, it might compensate for this by allowing phytoplankton tohave a high sensitivity to nutrients. Recently, we demonstrated that neuralnetwork ensembles (NNEs) are capable of extracting relationships betweenpredictor and target variables within ocean biogeochemical models. Beingable to view the relationships between variables, along with spatiotemporaldistributions, allows for a more mechanistically based examination of ESMoutputs. Here, we investigated whether we could apply NNEs to help usdetermine why different ESMs produce different spatiotemporal distributionsof phytoplankton biomass. We tested this using three cases. The first andsecond case used different runs of the same ESM, except that the physicalcirculations differed between them in the first case, while the biologicalequations differed between them in the second. Our results indicated thatthe NNEs were capable of extracting the relationships between variables fordifferent runs of a single ESM, allowing us to distinguish betweendifferences due to changes in circulation (which do not changerelationships) from changes in biogeochemical formulation (which do changerelationships). In the third case, we applied NNEs to two different ESMs.The results of the third case highlighted the capability of NNEs to contrastthe apparent relationships of different ESMs and some of the challenges itpresents. Although applied specifically to the ocean components of an ESM,our study demonstrates that Earth system modelers can use NNEs to separatethe contributions of different components of ESMs. Specifically, this allowsmodelers to compare the apparent relationships across different ESMs andobservational datasets.  more » « less
Award ID(s):
1756568
PAR ID:
10338102
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Geoscientific Model Development
Volume:
15
Issue:
4
ISSN:
1991-9603
Page Range / eLocation ID:
1595 to 1617
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract. A key challenge for biological oceanography is relating the physiologicalmechanisms controlling phytoplankton growth to the spatial distribution ofthose phytoplankton. Physiological mechanisms are often isolated by varyingone driver of growth, such as nutrient or light, in a controlled laboratorysetting producing what we call “intrinsic relationships”. We contrastthese with the “apparent relationships” which emerge in the environment inclimatological data. Although previous studies have found machine learning(ML) can find apparent relationships, there has yet to be a systematic studyexamining when and why these apparent relationships diverge from theunderlying intrinsic relationships found in the lab and how and why this may depend on the method applied. Here we conduct a proof-of-concept studywith three scenarios in which biomass is by construction a function oftime-averaged phytoplankton growth rate. In the first scenario, the inputsand outputs of the intrinsic and apparent relationships vary over thesame monthly timescales. In the second, the intrinsic relationships relateaverages of drivers that vary on hourly timescales to biomass, but theapparent relationships are sought between monthly averages of these inputsand monthly-averaged output. In the third scenario we apply ML to the outputof an actual Earth system model (ESM). Our results demonstrated that whenintrinsic and apparent relationships operate on the same spatial andtemporal timescale, neural network ensembles (NNEs) were able to extract theintrinsic relationships when only provided information about the apparentrelationships, while colimitation and its inability to extrapolate resulted in random forests (RFs) diverging from the true response. Whenintrinsic and apparent relationships operated on different timescales (aslittle separation as hourly versus daily), NNEs fed with apparentrelationships in time-averaged data produced responses with the right shapebut underestimated the biomass. This was because when the intrinsicrelationship was nonlinear, the response to a time-averaged input differedsystematically from the time-averaged response. Although the limitationsfound by NNEs were overestimated, they were able to produce more realisticshapes of the actual relationships compared to multiple linear regression.Additionally, NNEs were able to model the interactions between predictorsand their effects on biomass, allowing for a qualitative assessment of thecolimitation patterns and the nutrient causing the most limitation. Futureresearch may be able to use this type of analysis for observational datasetsand other ESMs to identify apparent relationships between biogeochemicalvariables (rather than spatiotemporal distributions only) and identifyinteractions and colimitations without having to perform (or at leastperforming fewer) growth experiments in a lab. From our study, it appearsthat ML can extract useful information from ESM output and could likely doso for observational datasets as well. 
    more » « less
  2. Abstract Phytoplankton stoichiometry modulates the interaction between carbon, nitrogen and phosphorus cycles. Environmentally driven variations in phytoplankton C:N:P can alter biogeochemical cycling compared to expectations under fixed ratios. In fact, the assumption of fixed C:N:P has been linked to Earth System Model (ESM) biases and potential misrepresentation of responses to future change. Here we integrate key elements of the Adaptive Trait Optimization Model (ATOM) for phytoplankton stoichiometry with the Carbon, Ocean Biogeochemistry and Lower Trophics (COBALT) ocean biogeochemical model. Within a series of global ocean‐ice‐ecosystem retrospective simulations, ATOM‐COBALT reproduced observations of phytoplankton N:P, and compared to static ratios, exhibited reduced phytoplankton P‐limitation, enhanced N‐fixation, and increased low‐latitude export, improving consistency with observations and highlighting the biogeochemical implications of dynamic N:P. We applied ATOM‐COBALT to explore the impacts of different physiological mechanisms hypothesized to underlie N:P variation, finding that two mechanisms together drove the observed patterns: proportionality of P‐rich ribosomes in phytoplankton cells to growth rates and reductions in P‐storage during scarcity. A third mechanism which linked temperature with phytoplankton biomass allocations to non‐ribosomal proteins, led only to relatively modest impacts because this mechanism decreased the temperature dependence of phytoplankton growth rates, compensating for changes in N:P. We find that there are quantitative response differences that associate distinctive biogeochemical footprints with each mechanism, which are most apparent in highly productive low‐latitude regions. These results suggest that variable phytoplankton N:P makes phytoplankton productivity and export resilient to environmental changes, and support further research on the physiological and environmental drivers of phytoplankton stoichiometry and biogeochemical role. 
    more » « less
  3. Surface ocean phosphate is commonly below the standard analytical detection limits, leading to an incomplete picture of the global variation and biogeochemical role of phosphate. A global compilation of phosphate measured using high-sensitivity methods revealed several previously unrecognized low-phosphate areas and clear regional differences. Both observational climatologies and Earth system models (ESMs) systematically overestimated surface phosphate. Furthermore, ESMs misrepresented the relationships between phosphate, phytoplankton biomass, and primary productivity. Atmospheric iron input and nitrogen fixation are known important controls on surface phosphate, but model simulations showed that differences in the iron-to-macronutrient ratio in the vertical nutrient supply and surface lateral transport are additional drivers of phosphate concentrations. Our study demonstrates the importance of accurately quantifying nutrients for understanding the regulation of ocean ecosystems and biogeochemistry now and under future climate conditions. 
    more » « less
  4. Abstract Internal climate variability plays an important role in the abundance and distribution of phytoplankton in the global ocean. Previous studies using large ensembles of Earth system models (ESMs) have demonstrated their utility in the study of marine phytoplankton variability. These ESM large ensembles simulate the evolution of multiple alternate realities, each with a different phasing of internal climate variability. However, ESMs may not accurately represent real world variability as recorded via satellite and in situ observations of ocean chlorophyll over the past few decades. Observational records of surface ocean chlorophyll equate to a single ensemble member in the large ensemble framework, and this can cloud the interpretation of long‐term trends: are they externally forced, caused by the phasing of internal variability, or both? Here, we use a novel statistical emulation technique to place the observational record of surface ocean chlorophyll into the large ensemble framework. Much like a large initial condition ensemble generated with an ESM, the resulting synthetic ensemble represents multiple possible evolutions of ocean chlorophyll concentration, each with a different sampling of internal climate variability. We further demonstrate the validity of our statistical approach by recreating an ESM ensemble of chlorophyll using only a single ESM ensemble member. We use the synthetic ensemble to explore the interpretation of long‐term trends in the presence of internal variability and find a wider range of possible trends in chlorophyll due to the sampling of internal variability in subpolar regions than in subtropical regions. 
    more » « less
  5. Abstract Background subsurface vertical mixing rates in the Southern Ocean (SO) are known to vary by an order of magnitude temporally and spatially, due to variability in their generating mechanisms, which include winds and shear instabilities at the surface, and the interaction of tides and lee waves with rough bottom topography. There is great uncertainty in the parameterization of this mixing in coarse resolution Earth System Models (ESM), and in the impact that this has on SO biological productivity on sub decadal timescales. Using a data assimilating biogeochemical ocean model we show that SO phytoplankton productivity is highly sensitive to differences in background diapycnal mixing over short timescales. Changes in the background vertical mixing rates alter key biogeochemical and physical conditions. The greatest changes to the distribution of physical and biogeochemical tracers occur in regions with very strong tracer vertical gradients. A combination of reduced nutrient limitation and reduced light limitation causes a strong increase in SO phytoplankton productivity with higher background mixing. This leads to increased summer carbon export but reduced wintertime export over the mixed layer depth, which could alter the strength of the SO biological carbon pump and atmospheric concentrations on centennial to millennial timescales. This study demonstrates the importance of accurately representing diapycnal mixing in ESM to predict SO biogeochemical dynamics and their broader climatic implications. 
    more » « less