skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Deep learning with citizen science data enables estimation of species diversity and composition at continental extents
Abstract Effective solutions to conserve biodiversity require accurate community‐ and species‐level information at relevant, actionable scales and across entire species' distributions. However, data and methodological constraints have limited our ability to provide such information in robust ways. Herein we employ a Deep‐Reasoning Network implementation of the Deep Multivariate Probit Model (DMVP‐DRNets), an end‐to‐end deep neural network framework, to exploit large observational and environmental data sets together and estimate landscape‐scale species diversity and composition at continental extents. We present results from a novel year‐round analysis of North American avifauna using data from over nine million eBird checklists and 72 environmental covariates. We highlight the utility of our information by identifying critical areas of high species diversity for a single group of conservation concern, the North American wood warblers, while capturing spatiotemporal variation in species' environmental associations and interspecific interactions. In so doing, we demonstrate the type of accurate, high‐resolution information on biodiversity that deep learning approaches such as DMVP‐DRNets can provide and that is needed to inform ecological research and conservation decision‐making at multiple scales.  more » « less
Award ID(s):
1939187
PAR ID:
10470576
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Ecology
Volume:
104
Issue:
12
ISSN:
0012-9658
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Abstract Biodiversity is rapidly changing due to changes in the climate and human related activities; thus, the accurate predictions of species composition and diversity are critical to developing conservation actions and management strategies. In this paper, using satellite remote sensing products as covariates, we constructed stacked species distribution models (S-SDMs) under a Bayesian framework to build next-generation biodiversity models. Model performance of these models was assessed using oak assemblages distributed across the continental United States obtained from the National Ecological Observatory Network (NEON). This study represents an attempt to evaluate the integrated predictions of biodiversity models—including assemblage diversity and composition—obtained by stacking next-generation SDMs. We found that applying constraints to assemblage predictions, such as using the probability ranking rule, does not improve biodiversity prediction models. Furthermore, we found that independent of the stacking procedure (bS-SDM versus pS-SDM versus cS-SDM), these kinds of next-generation biodiversity models do not accurately recover the observed species composition at the plot level or ecological-community scales (NEON plots are 400 m 2 ). However, these models do return reasonable predictions at macroecological scales, i.e., moderately to highly correct assignments of species identities at the scale of NEON sites (mean area ~ 27 km 2 ). Our results provide insights for advancing the accuracy of prediction of assemblage diversity and composition at different spatial scales globally. An important task for future studies is to evaluate the reliability of combining S-SDMs with direct detection of species using image spectroscopy to build a new generation of biodiversity models that accurately predict and monitor ecological assemblages through time and space. 
    more » « less
  2. Abstract Current understanding of ecological and evolutionary processes underlying island biodiversity is heavily shaped by empirical data from plants and birds, although arthropods comprise the overwhelming majority of known animal species, and as such can provide key insights into processes governing biodiversity. Novel high throughput sequencing (HTS) approaches are now emerging as powerful tools to overcome limitations in the availability of arthropod biodiversity data, and hence provide insights into these processes. Here, we explored how these tools might be most effectively exploited for comprehensive and comparable inventory and monitoring of insular arthropod biodiversity. We first reviewed the strengths, limitations and potential synergies among existing approaches of high throughput barcode sequencing. We considered how this could be complemented with deep learning approaches applied to image analysis to study arthropod biodiversity. We then explored how these approaches could be implemented within the framework of an island Genomic Observatories Network (iGON) for the advancement of fundamental and applied understanding of island biodiversity. To this end, we identified seven island biology themes at the interface of ecology, evolution and conservation biology, within which collective and harmonized efforts in HTS arthropod inventory could yield significant advances in island biodiversity research. 
    more » « less
  3. Abstract AimWe investigate geographic patterns across taxonomic, ecological and phylogenetic diversity to test for spatial (in)congruency and identify aggregate diversity hotspots in relationship to present land use and future climate. Simulating extinctions of imperilled species, we demonstrate where losses across diversity dimensions and geography are predicted. LocationNorth America. Time periodPresent day, future. Major taxa studiedRodentia. MethodsUsing geographic range maps for rodent species, we quantified spatial patterns for 11 dimensions of diversity: taxonomic (species, range weighted), ecological (body size, diet and habitat), phylogenetic (mean, variance, and nearest‐neighbour patristic distances, phylogenetic distance and genus‐to‐species ratio) and phyloendemism. We tested for correlations across dimensions and used spatial residual analyses to illustrate regions of pronounced diversity. We aggregated diversity hotspots in relationship to predictions of land‐use and climate change and recalculated metrics following extinctions of IUCN‐listed imperilled species. ResultsTopographically complex western North America hosts high diversity across multiple dimensions: phyloendemism and ecological diversity exceed predictions based on taxonomic richness, and phylogenetic variance patterns indicate steep gradients in phylogenetic turnover. An aggregate diversity hotspot emerges in the west, whereas spatial incongruence exists across diversity dimensions at the continental scale. Notably, phylogenetic metrics are uncorrelated with ecological diversity. Diversity hotspots overlap with land‐use and climate change, and extinctions predicted by IUCN status are unevenly distributed across space, phylogeny or ecological groups. Main conclusionsComparison of taxonomic, ecological and phylogenetic diversity patterns for North American rodents clearly shows the multifaceted nature of biodiversity. Testing for geographic patterns and (in)congruency across dimensions of diversity facilitates investigation into underlying ecological and evolutionary processes. The geographic scope of this analysis suggests that several explicit regional challenges face North American rodent fauna in the future. Simultaneous consideration of multi‐dimensional biodiversity allows us to assess what critical functions or evolutionary history we might lose with future extinctions and maximize the potential of our conservation efforts. 
    more » « less
  4. Abstract ContextShifts in climate and land use have dramatically reshaped ecosystems, impacting the distribution and status of wildlife populations. For many species, data gaps limit inference regarding population trends and links to environmental change. This deficiency hinders our ability to enact meaningful conservation measures to protect at risk species. ObjectivesWe investigated historical drivers of environmental niche change for three North American weasel species (American ermine, least weasel, and long-tailed weasel) to understand their response to environmental change. MethodsUsing species occurrence records and corresponding environmental data, we developed species-specific environmental niche models for the contiguous United States (1938–2021). We generated annual hindcasted predictions of the species’ environmental niche, assessing changes in distribution, area, and fragmentation in response to environmental change. ResultsWe identified a 54% decline in suitable habitat alongside high levels of fragmentation for least weasels and region-specific trends for American ermine and long-tailed weasels; declines in the West and increased suitability in the East. Climate and land use were important predictors of the environmental niche for all species. Changes in habitat amount and distribution reflected widespread land use changes over the past century while declines in southern and low-elevation areas are consistent with impacts from climatic change. ConclusionsOur models uncovered land use and climatic change as potential historic drivers of population change for North American weasels and provide a basis for management recommendations and targeted survey efforts. We identified potentially at-risk populations and a need for landscape-level planning to support weasel populations amid ongoing environmental changes. 
    more » « less
  5. Over 300 million arthropod specimens are housed in North American natural history collections. These collections represent a “vast hidden treasure trove” of biodiversity −95% of the specimen label data have yet to be transcribed for research, and less than 2% of the specimens have been imaged. Specimen labels contain crucial information to determine species distributions over time and are essential for understanding patterns of ecology and evolution, which will help assess the growing biodiversity crisis driven by global change impacts. Specimen images offer indispensable insight and data for analyses of traits, and ecological and phylogenetic patterns of biodiversity. Here, we review North American arthropod collections using two key metrics, specimen holdings and digitization efforts, to assess the potential for collections to provide needed biodiversity data. We include data from 223 arthropod collections in North America, with an emphasis on the United States. Our specific findings are as follows: (1) The majority of North American natural history collections (88%) and specimens (89%) are located in the United States. Canada has comparable holdings to the United States relative to its estimated biodiversity. Mexico has made the furthest progress in terms of digitization, but its specimen holdings should be increased to reflect the estimated higher Mexican arthropod diversity. The proportion of North American collections that has been digitized, and the number of digital records available per species, are both much lower for arthropods when compared to chordates and plants. (2) The National Science Foundation’s decade-long ADBC program (Advancing Digitization of Biological Collections) has been transformational in promoting arthropod digitization. However, even if this program became permanent, at current rates, by the year 2050 only 38% of the existing arthropod specimens would be digitized, and less than 1% would have associated digital images. (3) The number of specimens in collections has increased by approximately 1% per year over the past 30 years. We propose that this rate of increase is insufficient to provide enough data to address biodiversity research needs, and that arthropod collections should aim to triple their rate of new specimen acquisition. (4) The collections we surveyed in the United States vary broadly in a number of indicators. Collectively, there is depth and breadth, with smaller collections providing regional depth and larger collections providing greater global coverage. (5) Increased coordination across museums is needed for digitization efforts to target taxa for research and conservation goals and address long-term data needs. Two key recommendations emerge: collections should significantly increase both their specimen holdings and their digitization efforts to empower continental and global biodiversity data pipelines, and stimulate downstream research. 
    more » « less