Environmental conditions are dynamic, and plants respond to those dynamics on multiple time scales. Disequilibrium occurs when a response occurs more slowly than the driving environmental changes. We review evidence regarding disequilibrium in plant distributions, including their responses to paleoclimate changes, recent climate change and new species introductions. There is strong evidence that plant species distributions are often in some disequilibrium with their environmental conditions.This disequilibrium poses a challenge when projecting future species distributions using species distribution models (SDMs). Classically, SDMs assume that the set of species occurrences is an unbiased sample of the suitable environmental conditions. However, a species in disequilibrium with the environment may have higher‐than‐expected occurrence probabilities (e.g. due to extinction debts) or lower‐than‐expected occurrence probabilities (e.g. due to dispersal limitation) in different areas. If unaccounted for, this will lead to biased estimates of the environmental suitability.We review methods for avoiding such biases in SDMs, ranging from simple thinning of the occurrence dataset to complex dynamic and process‐based models. Such models require large data inputs, natural history knowledge and technical expertise, so implementing them can be challenging. Despite this, we advocate for their increased use, since process‐based models provide the best potential to account for biases in model training data and to then represent the dynamics of species occupancy as ranges shift.Synthesis. Occurrence records for a species are often in disequilibrium with climate. SDMs trained on such data will produce biased estimates of a species' niche unless this disequilibrium is addressed in the modelling. A range of tools, spanning a wide gradient of complexity and realism, can resolve this bias.
more »
« less
Thinning occurrence points does not improve species distribution model performance
Abstract Spatial biases are an intrinsic feature of occurrence data used in species distribution models (SDMs). Thinning species occurrences, where records close in the geographic or environmental space are removed from the modeling procedure, is an approach often used to address these biases. However, thinning occurrence data can also negatively affect SDM performance, given that the benefits of removing spatial biases might be outweighed by the detrimental effects of data loss caused by this approach. We used real and virtual species to evaluate how spatial and environmental thinning affected different performance metrics of four SDM methods. The occurrence data of virtual species were sampled randomly, evenly spaced, and clustered in the geographic space to simulate different types of spatial biases, and several spatial and environmental thinning distances were used to thin the occurrence data. Null datasets were also generated for each thinning distance where we randomly removed the same number of occurrences by a thinning distance and compared the results of the thinned and null datasets. We found that spatially or environmentally thinned occurrence data is no better than randomly removing them, given that thinned datasets performed similarly to null datasets. Specifically, spatial and environmental thinning led to a general decrease in model performances across all SDM methods. These results were observed for real and virtual species, were positively associated with thinning distance, and were consistent across the different types of spatial biases. Our results suggest that thinning occurrence data usually fails to improve SDM performance and that the use of thinning approaches when modeling species distributions should be considered carefully.
more »
« less
- Award ID(s):
- 2213878
- PAR ID:
- 10480728
- Publisher / Repository:
- Wiley Blackwell (John Wiley & Sons)
- Date Published:
- Journal Name:
- Ecosphere
- Volume:
- 14
- Issue:
- 12
- ISSN:
- 2150-8925
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
ABSTRACT Species distribution modeling can be used to predict environmental suitability, and removing areas currently lacking appropriate vegetation can refine range estimates for conservation assessments. However, the uncertainty around geographic coordinates can exceed the fine resolution of remotely sensed habitat data. Here, we present a novel methodological approach to reflect this reality by processing habitat data to maintain its fine resolution, but with new values characterizing a larger surrounding area (the “neighborhood”). We implement its use for a forest‐dwelling species (Handleyomys chapmani) considered threatened by the IUCN. We determined deforestation tolerance threshold values by matching occurrence records with forest cover data using two methods: (1) extracting the exact pixel value where a record fell; and (2) using the neighborhood value (more likely to characterize conditions within the radius of actual sampling). We removed regions below these thresholds from the climatic suitability prediction, identifying areas of inferred habitat loss. We calculated Extent of Occurrence (EOO) and Area of Occupancy (AOO), two metrics used by the IUCN for threat level categorization. The values estimated here suggest removing the species from threatened categories. However, the results highlight spatial patterns of loss throughout the range not reflected in these metrics, illustrating drawbacks of EOO and showing how localized losses largely disappeared when resampling to the 2 × 2 km grid required for AOO. The neighborhood approach can be applied to various data sources (NDVI, soils, marine, etc.) to calculate trends over time and should prove useful to many terrestrial and aquatic species. It is particularly useful for species having high coordinate uncertainty in regions of low spatial autocorrelation (where small georeferencing errors can lead to great differences in habitat, misguiding conservation assessments used in policy decisions). More generally, this study illustrates and enhances the practicality of using habitat‐refined distribution maps for biogeography and conservation.more » « less
-
Species distribution and ecological niche models (hereafter SDMs) are popular tools with broad applications in ecology, biodiversity conservation, and environmental science. Many SDM applications require projecting models in environmental conditions non‐analog to those used for model training (extrapolation), giving predictions that may be statistically unsupported and biologically meaningless. We introduce a novel method, Shape, a model‐agnostic approach that calculates the extrapolation degree for a given projection data point by its multivariate distance to the nearest training data point. Such distances are relativized by a factor that reflects the dispersion of the training data in environmental space. Distinct from other approaches, Shape incorporates an adjustable threshold to control the binary discrimination between acceptable and unacceptable extrapolation degrees. We compared Shape's performance to five extrapolation metrics based on their ability to detect analog environmental conditions in environmental space and improve SDMs suitability predictions. To do so, we used 760 virtual species to define different modeling conditions determined by species niche tolerance, distribution equilibrium condition, sample size, and algorithm. All algorithms had trouble predicting species niches. However, we found a substantial improvement in model predictions when model projections were truncated independently of extrapolation metrics. Shape's performance was dependent on extrapolation threshold used to truncate models. Because of this versatility, our approach showed similar or better performance than the previous approaches and could better deal with all modeling conditions and algorithms. Our extrapolation metric is simple to interpret, captures the complex shapes of the data in environmental space, and can use any extrapolation threshold to define whether model predictions are retained based on the extrapolation degrees. These properties make this approach more broadly applicable than existing methods for creating and applying SDMs. We hope this method and accompanying tools support modelers to explore, detect, and reduce extrapolation errors to achieve more reliable models. Keywords: environmental novelty, extrapolation, Mahalanobis distance, model prediction, non‐analog environmental data, transferabilitymore » « less
-
Primary biodiversity data records that are open access and available in a standardised format are essential for conservation planning and research on policy-relevant time-scales. We created a dataset to document all known occurrence data for the Federally Endangered Poweshiek skipperling butterfly [ Oarismapoweshiek (Parker, 1870; Lepidoptera: Hesperiidae)]. The Poweshiek skipperling was a historically common species in prairie systems across the upper Midwest, United States and Manitoba, Canada. Rapid declines have reduced the number of verified extant sites to six. Aggregating and curating Poweshiek skipperling occurrence records documents and preserves all known distributional data, which can be used to address questions related to Poweshiek skipperling conservation, ecology and biogeography. Over 3500 occurrence records were aggregated over a temporal coverage from 1872 to present. Occurrence records were obtained from 37 data providers in the conservation and natural history collection community using both “HumanObservation” and “PreservedSpecimen” as an acceptable basisOfRecord. Data were obtained in different formats and with differing degrees of quality control. During the data aggregation and cleaning process, we transcribed specimen label data, georeferenced occurrences, adopted a controlled vocabulary, removed duplicates and standardised formatting. We examined the dataset for inconsistencies with known Poweshiek skipperling biogeography and phenology and we verified or removed inconsistencies by working with the original data providers. In total, 12 occurrence records were removed because we identified them to be the western congener Oarismagarita (Reakirt, 1866). This resulting dataset enhances the permanency of Poweshiek skipperling occurrence data in a standardised format. This is a validated and comprehensive dataset of occurrence records for the Poweshiek skipperling ( Oarismapoweshiek ) utilising both observation and specimen-based records. Occurrence data are preserved and available for continued research and conservation projects using standardised Darwin Core formatting where possible. Prior to this project, much of these occurrence records were not mobilised and were being stored in individual institutional databases, researcher datasets and personal records. This dataset aggregates presence data from state conservation agencies, natural heritage programmes, natural history collections, citizen scientists, researchers and the U.S. Fish & Wildlife Service. The data include opportunistic observations and collections, research vouchers, observations collected for population monitoring and observations collected using standardised research methodologies. The aggregated occurrence records underwent cleaning efforts that improved data interoperablitity, removed transcription errors and verified or removed uncertain data. This dataset enhances available information on the spatiotemporal distribution of this Federally Endangered species. As part of this aggregation process, we discovered and verified Poweshiek skipperling occurrence records from two previously unknown states, Nebraska and Ohio.more » « less
-
Climate change poses a threat to biodiversity, and it is unclear whether species can adapt to or tolerate new conditions, or migrate to areas with suitable habitats. Reconstructions of range shifts that occurred in response to environmental changes since the last glacial maximum (LGM) from species distribution models (SDMs) can provide useful data to inform conservation efforts. However, different SDM algorithms and climate reconstructions often produce contrasting patterns, and validation methods typically focus on accuracy in recreating current distributions, limiting their relevance for assessing predictions to the past or future. We modeled historically suitable habitat for the threatened North American tree green ashFraxinus pennsylvanicausing 24 SDMs built using two climate models, three calibration regions, and four modeling algorithms. We evaluated the SDMs using contemporary data with spatial block cross‐validation and compared the relative support for alternative models using a novel integrative method based on coupled demographic‐genetic simulations. We simulated genomic datasets using habitat suitability of each of the 24 SDMs in a spatially‐explicit model. Approximate Bayesian computation (ABC) was then used to evaluate the support for alternative SDMs through comparisons to an empirical population genomic dataset. Models had very similar performance when assessed with contemporary occurrences using spatial cross‐validation, but ABC model selection analyses consistently supported SDMs based on the CCSM climate model, an intermediate calibration extent, and the generalized linear modeling algorithm. Finally, we projected the future range of green ash under four climate change scenarios. Future projections using the SDMs selected via ABC suggest only minor shifts in suitable habitat for this species, while some of those that were rejected predicted dramatic changes. Our results highlight the different inferences that may result from the application of alternative distribution modeling algorithms and provide a novel approach for selecting among a set of competing SDMs with independent data.more » « less
An official website of the United States government
