skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Sensitivity of Codispersion to Noise and Error in Ecological and Environmental Data
Understanding relationships among tree species, or between tree diversity, distribution, and underlying environmental gradients, is a central concern for forest ecologists, managers, and management agencies. The spatial processes underlying observed spatial patterns of trees or edaphic variables often are complex and violate two fundamental assumptions—isotropy and stationarity—of spatial statistics. Codispersion analysis is a new statistical method developed to assess spatial covariation between two spatial processes that may not be isotropic or stationary. Its application to data from large forest plots has provided new insights into mechanisms underlying observed patterns of species distributions and the relationship between individual species and underlying edaphic and topographic gradients. However, these data are not collected without error, and the performance of the codispersion coefficient when there is noise or measurement error (“contamination”) in the data heretofore has been addressed only theoretically. Here, we use Monte Carlo simulations and real datasets to investigate the sensitivity of codispersion to four types of contamination commonly seen in many forest datasets. Three of these involved comparing codispersion of a spatial dataset with a contaminated version of itself. The fourth examined differences in codispersion between tree species and soil variables, where the estimates of soil characteristics were based on complete or thinned datasets. In all cases, we found that estimates of codispersion were robust when contamination was relatively low (<15%), but were sensitive to larger percentages of contamination. We also present a useful method for imputing missing spatial data and discuss several aspects of the codispersion coefficient when applied to noisy data to gain more insight about the performance of codispersion in practice.  more » « less
Award ID(s):
1832210
PAR ID:
10154591
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Forests
Volume:
9
Issue:
11
ISSN:
1999-4907
Page Range / eLocation ID:
679
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract AimAmazonia hosts more tree species from numerous evolutionary lineages, both young and ancient, than any other biogeographic region. Previous studies have shown that tree lineages colonized multiple edaphic environments and dispersed widely across Amazonia, leading to a hypothesis, which we test, that lineages should not be strongly associated with either geographic regions or edaphic forest types. LocationAmazonia. TaxonAngiosperms (Magnoliids; Monocots; Eudicots). MethodsData for the abundance of 5082 tree species in 1989 plots were combined with a mega‐phylogeny. We applied evolutionary ordination to assess how phylogenetic composition varies across Amazonia. We used variation partitioning and Moran's eigenvector maps (MEM) to test and quantify the separate and joint contributions of spatial and environmental variables to explain the phylogenetic composition of plots. We tested the indicator value of lineages for geographic regions and edaphic forest types and mapped associations onto the phylogeny. ResultsIn the terra firme and várzea forest types, the phylogenetic composition varies by geographic region, but the igapó and white‐sand forest types retain a unique evolutionary signature regardless of region. Overall, we find that soil chemistry, climate and topography explain 24% of the variation in phylogenetic composition, with 79% of that variation being spatially structured (R2 = 19% overall for combined spatial/environmental effects). The phylogenetic composition also shows substantial spatial patterns not related to the environmental variables we quantified (R2 = 28%). A greater number of lineages were significant indicators of geographic regions than forest types. Main ConclusionNumerous tree lineages, including some ancient ones (>66 Ma), show strong associations with geographic regions and edaphic forest types of Amazonia. This shows that specialization in specific edaphic environments has played a long‐standing role in the evolutionary assembly of Amazonian forests. Furthermore, many lineages, even those that have dispersed across Amazonia, dominate within a specific region, likely because of phylogenetically conserved niches for environmental conditions that are prevalent within regions. 
    more » « less
  2. Aims: Climate change is expected to shift climatic envelopes of temperate tree species into boreal forests where unsuitable soils may limit range expansion. We studied several edaphic thresholds (mycorrhizae, soil chemistry) that can limit seedling establishment of two major temperate tree species, sugar maple (arbuscular mycorrhizal, AM) and American beech (ectomycorrhizal, EM). Methods: We integrate two field surveys of tree seedling density, mycorrhizal colonization, and soil chemistry in montane forests of the Adirondack and Green Mountains (Mtns) in the northeastern United States. We conducted correlation and linear breakpoint analyses to detect soil abiotic and biotic thresholds in seedling distributions across edaphic gradients. Results: In the Green Mtns, sugar maple seedling importance (an index of species relative density and frequency, IV) declined sharply with low pH (< 3.74 in mineral soil) and low mycorrhizal colonization (< 27.5% root length colonized). Sugar maple importance was highly correlated with multiple aspects of soil chemistry, while beech was somewhat sensitive to pH only; beech mycorrhizal colonization did not differ across elevation. Mycorrhizal colonization of sugar maple was positively correlated with soil pH and conspecific overstory basal area. In the Adirondacks, sugar maple importance, but not beech, plateaued above thresholds in soil calcium (~ 2 meq/100 g) and magnesium (~ 0.3 meq/100 g). Conclusions: The establishment of sugar maple, but not beech, was impeded by both biotic and abiotic soil components in montane conifer forests and by soil acidity in temperate deciduous forests. These differences in species sensitivity to edaphic thresholds will likely affect species success and future shifts in forest composition. 
    more » « less
  3. null (Ed.)
    When forest conditions are mapped from empirical models, uncertainty in remotely sensed predictor variables can cause the systematic overestimation of low values, underestimation of high values, and suppression of variability. This regression dilution or attenuation bias is a well-recognized problem in remote sensing applications, with few practical solutions. Attenuation is of particular concern for applications that are responsive to prediction patterns at the high end of observed data ranges, where systematic error is typically greatest. We addressed attenuation bias in models of tree species relative abundance (percent of total aboveground live biomass) based on multitemporal Landsat and topoclimatic predictor data. We developed a multi-objective support vector regression (MOSVR) algorithm that simultaneously minimizes total prediction error and systematic error caused by attenuation bias. Applied to 13 tree species in the Acadian Forest Region of the northeastern U.S., MOSVR performed well compared to other prediction methods including single-objective SVR (SOSVR) minimizing total error, Random Forest (RF), gradient nearest neighbor (GNN), and Random Forest nearest neighbor (RFNN) algorithms. SOSVR and RF yielded the lowest total prediction error but produced the greatest systematic error, consistent with strong attenuation bias. Underestimation at high relative abundance caused strong deviations between predicted patterns of species dominance/codominance and those observed at field plots. In contrast, GNN and RFNN produced dominance/codominance patterns that deviated little from observed patterns, but predicted species relative abundance with lower accuracy and substantial systematic error. MOSVR produced the least systematic error for all species with total error often comparable to SOSVR or RF. Predicted patterns of dominance/codominance matched observations well, though not quite as well as GNN or RFNN. Overall, MOSVR provides an effective machine learning approach to the reduction of systematic prediction error and should be fully generalizable to other remote sensing applications and prediction problems. 
    more » « less
  4. The role of intraspecific trait variation in functional ecology has gained traction in recent years as many papers have observed its importance in driving community diversity and ecology. Yet much of the work in this field relies on field-based trait surveys. Here, we used continuous canopy trait information derived from remote sensing data of a highly polymorphic tree species, Metrosideros polymorpha, to quantify environmental controls on intraspecific trait variation. M. polymorpha, an endemic, keystone tree species in Hawai’i, varies morphologically, chemically, and genetically across broad elevation and soil substrate age gradients, making it an ideal model organism to explore large-scale environmental drivers of intraspecific trait variation. M. polymorpha canopy reflectance (visible to shortwave infrared; 380–2510 nm) and light detection and ranging (LiDAR) data collected by the Global Airborne Observatory were modeled to canopy trait estimates of leaf mass per area, chlorophyll a and b, carotenoids, total carbon, nitrogen, phosphorus, phenols, cellulose, and top of canopy height using previously developed leaf chemometric equations. We explored how these derived traits varied across environmental gradients by extracting elevation, slope, aspect, precipitation, and soil substrate age data at canopy locations. We then obtained the feature importance values of the environmental factors in predicting each leaf trait by training random forest models to predict leaf traits individually. Of these environmental factors, elevation was the most important predictor for all canopy traits. Elevation not only affected canopy traits directly but also indirectly by influencing the relationships between soil substrate age and canopy traits as well as between nitrogen and other traits, as indicated by the change in slope between the variables at different elevation ranges. In conclusion, intraspecific variation in M. polymorpha traits derived from remote sensing adheres to known leaf economic spectrum (LES) patterns as well as interspecific LES traits previously mapped using imaging spectroscopy. 
    more » « less
  5. Unraveling the mechanisms underlying the maintenance of species diversity is a central pursuit in ecology. It has been hypothesized that ectomycorrhizal (EcM) in contrast to arbuscular mycorrhizal fungi can reduce tree species diversity in local communities, which remains to be tested at the global scale. To address this gap, we analyzed global forest inventory data and revealed that the relationship between tree species richness and EcM tree proportion varied along environmental gradients. Specifically, the relationship is more negative at low latitudes and in moist conditions but is unimodal at high latitudes and in arid conditions. The negative association of EcM tree proportion on species diversity at low latitudes and in humid conditions is likely due to more negative plant-soil microbial interactions in these regions. These findings extend our knowledge on the mechanisms shaping global patterns in plant species diversity from a belowground view. 
    more » « less