skip to main content


Title: Optimal model complexity for terrestrial carbon cycle prediction
Abstract. The terrestrial carbon cycle plays a critical role in modulating the interactions of climate with the Earth system, but different models often make vastly different predictions of its behavior. Efforts to reduce model uncertainty have commonly focused on model structure, namely by introducing additional processes and increasing structural complexity. However, the extent to which increased structural complexity can directly improve predictive skill is unclear. While adding processes may improve realism, the resulting models are often encumbered by a greater number of poorly determined or over-generalized parameters. To guide efficient model development, here we map the theoretical relationship between model complexity and predictive skill. To do so, we developed 16 structurally distinct carbon cycle models spanning an axis of complexity and incorporated them into a model–data fusion system. We calibrated each model at six globally distributed eddy covariance sites with long observation time series and under 42 data scenarios that resulted in different degrees of parameter uncertainty. For each combination of site, data scenario, and model, we then predicted net ecosystem exchange (NEE) and leaf area index (LAI) for validation against independent local site data. Though the maximum model complexity we evaluated is lower than most traditional terrestrial biosphere models, the complexity range we explored provides universal insight into the inter-relationship between structural uncertainty, parametric uncertainty, and model forecast skill. Specifically, increased complexity only improves forecast skill if parameters are adequately informed (e.g., when NEE observations are used for calibration). Otherwise, increased complexity can degrade skill and an intermediate-complexity model is optimal. This finding remains consistent regardless of whether NEE or LAI is predicted. Our COMPLexity EXperiment (COMPLEX) highlights the importance of robust observation-based parameterization for land surface modeling and suggests that data characterizing net carbon fluxes will be key to improving decadal predictions of high-dimensional terrestrial biosphere models.  more » « less
Award ID(s):
1942133
NSF-PAR ID:
10314190
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Biogeosciences
Volume:
18
Issue:
8
ISSN:
1726-4189
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Climate change is having significant impacts on Earth’s ecosystems and carbon budgets, and in the Arctic may drive a shift from an historic carbon sink to a source. Large uncertainties in terrestrial biosphere models (TBMs) used to forecast Arctic changes demonstrate the challenges of determining the timing and extent of this possible switch. This spread in model predictions can limit the ability of TBMs to guide management and policy decisions. One of the most influential sources of model uncertainty is model parameterization. Parameter uncertainty results in part from a mismatch between available data in databases and model needs. We identify that mismatch for three TBMs, DVM-DOS-TEM, SIPNET and ED2, and four databases with information on Arctic and boreal above- and belowground traits that may be applied to model parametrization. However, focusing solely on such data gaps can introduce biases towards simple models and ignores structural model uncertainty, another main source for model uncertainty. Therefore, we develop a causal loop diagram (CLD) of the Arctic and boreal ecosystem that includes unquantified, and thus unmodeled, processes. We map model parameters to processes in the CLD and assess parameter vulnerability via the internal network structure. One important substructure, feed forward loops (FFLs), describe processes that are linked both directly and indirectly. When the model parameters are data-informed, these indirect processes might be implicitly included in the model, but if not, they have the potential to introduce significant model uncertainty. We find that the parameters describing the impact of local temperature on microbial activity are associated with a particularly high number of FFLs but are not constrained well by existing data. By employing ecological models of varying complexity, databases, and network methods, we identify the key parameters responsible for limited model accuracy. They should be prioritized for future data sampling to reduce model uncertainty.

     
    more » « less
  2. As the Arctic region moves into uncharted territory under a warming climate, it is important to refine the terrestrial biosphere models (TBMs) that help us understand and predict change. One fundamental uncertainty in TBMs relates to model parameters, configuration variables internal to the model whose value can be estimated from data. We incorporate a version of the Terrestrial Ecosystem Model (TEM) developed for arctic ecosystems into the Predictive Ecosystem Analyzer (PEcAn) framework. PEcAn treats model parameters as probability distributions, estimates parameters based on a synthesis of available field data, and then quantifies both model sensitivity and uncertainty to a given parameter or suite of parameters. We examined how variation in 21 parameters in the equation for gross primary production influenced model sensitivity and uncertainty in terms of two carbon fluxes (net primary productivity and heterotrophic respiration) and two carbon (C) pools (vegetation C and soil C). We set up different parameterizations of TEM across a range of tundra types (tussock tundra, heath tundra, wet sedge tundra, and shrub tundra) in northern Alaska, along a latitudinal transect extending from the coastal plain near Utqiaġvik to the southern foothills of the Brooks Range, to the Seward Peninsula. TEM was most sensitive to parameters related to the temperature regulation of photosynthesis. Model uncertainty was mostly due to parameters related to leaf area, temperature regulation of photosynthesis, and the stomatal responses to ambient light conditions. Our analysis also showed that sensitivity and uncertainty to a given parameter varied spatially. At some sites, model sensitivity and uncertainty tended to be connected to a wider range of parameters, underlining the importance of assessing tundra community processes across environmental gradients or geographic locations. Generally, across sites, the flux of net primary productivity (NPP) and pool of vegetation C had about equal uncertainty, while heterotrophic respiration had higher uncertainty than the pool of soil C. Our study illustrates the complexity inherent in evaluating parameter uncertainty across highly heterogeneous arctic tundra plant communities. It also provides a framework for iteratively testing how newly collected field data related to key parameters may result in more effective forecasting of Arctic change. 
    more » « less
  3. Abstract

    Robust ecological forecasting of tree growth under future climate conditions is critical to anticipate future forest carbon storage and flux. Here, we apply three ingredients of ecological forecasting that are key to improving forecast skill: data fusion, confronting model predictions with new data, and partitioning forecast uncertainty. Specifically, we present the first fusion of tree‐ring and forest inventory data within a Bayesian state‐space model at a multi‐site, regional scale, focusing onPinus ponderosavar.brachypterain the southwestern US. Leveraging the complementarity of these two data sources, we parsed the ecological complexity of tree growth into the effects of climate, tree size, stand density, site quality, and their interactions, and quantified uncertainties associated with these effects. New measurements of trees, an ongoing process in forest inventories, were used to confront forecasts of tree diameter with observations, and evaluate alternative tree growth models. We forecasted tree diameter and increment in response to an ensemble of climate change projections, and separated forecast uncertainty into four different causes: initial conditions, parameters, climate drivers, and process error. We found a strong negative effect of fall–spring maximum temperature, and a positive effect of water‐year precipitation on tree growth. Furthermore, tree vulnerability to climate stress increases with greater competition, with tree size, and at poor sites. Under future climate scenarios, we forecast increment declines of 22%–117%, while the combined effect of climate and size‐related trends results in a 56%–91% decline. Partitioning of forecast uncertainty showed that diameter forecast uncertainty is primarily caused by parameter and initial conditions uncertainty, but increment forecast uncertainty is mostly caused by process error and climate driver uncertainty. This fusion of tree‐ring and forest inventory data lays the foundation for robust ecological forecasting of aboveground biomass and carbon accounting at tree, plot, and regional scales, including iterative improvement of model skill.

     
    more » « less
  4. Abstract

    Terrestrial biosphere models can help identify physical processes that control carbon dynamics, including land‐atmosphere CO2fluxes, and have the potential to project the terrestrial ecosystem response to changing climate. It is important to identify ecosystem processes most responsible for model predictive uncertainty and design improved model representation and observational system studies to reduce that uncertainty. Here we identified model parameters that contribute the most uncertainty to long‐term (~100 years) projections of net ecosystem exchange, net primary production, and aboveground biomass within a mechanistic terrestrial biosphere model (Ecosystem Demography, version 2.1) ED2. An uncertainty analysis identified parameters that represent the quantum efficiency of light to photosynthetic conversion, leaf respiration and soil‐plant water transfer as the highest contributors to model uncertainty regardless of time frame (annual, decadal, and centennial) and output (e.g., net ecosystem exchange, net primary production, aboveground biomass). Contrary to expectations, the contribution of successional processes related to reproduction, competition, and mortality did not increase as the time scale increased. These findings suggest that uncertainty in the parameters governing short‐term ecosystem processes remains the most significant bottleneck to reducing predictive uncertainty. Key actions to reduce parameter uncertainty include more leaf‐level trait measurements across multiple sites for quantum efficiency and leaf respiration rate. Further, the empirical representation of soil‐plant water transfer should be replaced with a mechanistic, hydraulic representation of water flow, which can be constrained with direct measurements. This analysis focused on aboveground ecosystem processes. The impact of belowground carbon cycling, initial conditions, and meteorological forcing should be addressed in future studies.

     
    more » « less
  5. Abstract

    Secondary forest regrowth shapes community succession and biogeochemistry for decades, including in the Upper Great Lakes region. Vegetation models encapsulate our understanding of forest function, and whether models can reproduce multi‐decadal succession patterns is an indication of our ability to predict forest responses to future change. We test the ability of a vegetation model to simulate C cycling and community composition during 100 years of forest regrowth following stand‐replacing disturbance, asking (a) Which processes and parameters are most important to accurately model Upper Midwest forest succession? (b) What is the relative importance of model structure versus parameter values to these predictions? We ran ensembles of the Ecosystem Demography model v2.2 with different representations of processes important to competition for light. We compared the magnitude of structural and parameter uncertainty and assessed which sub‐model–parameter combinations best reproduced observed C fluxes and community composition. On average, our simulations underestimated observed net primary productivity (NPP) and leaf area index (LAI) after 100 years and predicted complete dominance by a single plant functional type (PFT). Out of 4,000 simulations, only nine fell within the observed range of both NPP and LAI, but these predicted unrealistically complete dominance by either early hardwood or pine PFTs. A different set of seven simulations were ecologically plausible but under‐predicted observed NPP and LAI. Parameter uncertainty was large; NPP and LAI ranged from ~0% to >200% of their mean value, and any PFT could become dominant. The two parameters that contributed most to uncertainty in predicted NPP were plant–soil water conductance and growth respiration, both unobservable empirical coefficients. We conclude that (a) parameter uncertainty is more important than structural uncertainty, at least for ED‐2.2 in Upper Midwest forests and (b) simulating both productivity and plant community composition accurately without physically unrealistic parameters remains challenging for demographic vegetation models.

     
    more » « less