Abstract Natural history collections (NHC) provide a wealth of information that can be used to understand the impacts of global change on biodiversity. As such, there is growing interest in using NHC data to estimate changes in species' distributions and abundance trends over historic time horizons when contemporary survey data are limited or unavailable.However, museum specimens were not collected with the purpose of estimating population trends and thus can exhibit spatiotemporal and collector‐specific biases that can impose severe limitations to using NHC data for evaluating population trajectories.Here we review the challenges associated with using museum records to track long‐term insect population trends, including spatiotemporal biases in sampling effort and sparse temporal coverage within and across years. We highlight recent methodological advancements that aim to overcome these challenges and discuss emerging research opportunities.Specifically, we examine the potential of integrating museum records and other contemporary data sources (e.g. collected via structured, designed surveys and opportunistic citizen science programs) in a unified analytical framework that accounts for the sampling biases associated with each data source. The emerging field of integrated modelling provides a promising framework for leveraging the wealth of collections data to accurately estimate long‐term trends of insect populations and identify cases where that is not possible using existing data sources.
more »
« less
This content will become publicly available on June 19, 2026
The collector practices that shape spatial, temporal, and taxonomic bias in herbaria
Summary Natural history collections (NHCs) are essential for studying biodiversity. Although spatial, temporal, and taxonomic biases in NHCs affect analyses, the influence of collector practices on biases remains largely unexplored.We utilized one million digitized specimens collected in the northeastern United States byc.10 000 collectors to investigate how collector practices shape spatial, temporal, and taxonomic biases in NHCs; and similarities and differences between practices of more‐ and less‐prolific collectors.We identified six common collector practices, or collection norms: collectors generally collected different species, from multiple locations, from sites sampled by others, during the principal growing season, species identifiable outside peak collecting months, and species from species‐poor families and genera. Some norms changed over decades, with different taxa favored during different periods. Collection norms have increased taxonomic coverage in NHCs; however, collectors typically avoided large, taxonomically complex groups, causing their underrepresentation in NHCs. Less‐prolific collectors greatly enhanced coverage by collecting during more months and from less‐sampled locations.We assert that overall collection biases are shaped by shared predictable collection norms rather than random practices of individual collectors. Predictable biases offer an opportunity to more effectively address biases in future biodiversity models.
more »
« less
- Award ID(s):
- 2101884
- PAR ID:
- 10643700
- Publisher / Repository:
- Wiley-Blackwell
- Date Published:
- Journal Name:
- New Phytologist
- ISSN:
- 0028-646X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Although savanna woody encroachment has become a global phenomenon, relatively little is known about its effects on multiple dimensions and levels of savanna biodiversity.Using a combination of field surveys, a species‐level phylogeny, and functional metrics drawn from a morphological dataset, we evaluated how the progressive increase in tree cover in a fire‐suppressed savanna landscape affects the taxonomic, functional, and phylogenetic diversity of neotropical ant communities, at both the alpha and beta levels. Ants were sampled along an extensive tree cover gradient, ranging from open savannas to forests established in former savanna areas.Variation in tree cover had a significant influence on all facets of diversity at the beta level, whereas at the alpha level tree cover variation affected the taxonomic and functional but not the phylogenetic diversity of the ant communities.In general, ant community responses to variation in tree cover were largely non‐linear as differences in taxonomic alpha diversity and in the taxonomic, functional, and phylogenetic composition of the sampled communities were often much stronger at the savanna/forest transition than at any other part of the gradient. This indicates that savanna ant communities switch rapidly to an alternative state once the savanna turns into forest.Ant communities in the newly formed forest areas lacked many of the species typical of the savanna habitats, suggesting that the maintenance of a fire suppression policy is likely to result in a decrease in ant diversity and in the homogenisation of the ant fauna at the landscape scale.more » « less
-
Abstract Environmental and anthropogenic factors affect the population dynamics of migratory species throughout their annual cycles. However, identifying the spatiotemporal drivers of migratory species' abundances is difficult because of extensive gaps in monitoring data. The collection of unstructured opportunistic data by volunteer (citizen science) networks provides a solution to address data gaps for locations and time periods during which structured, design‐based data are difficult or impossible to collect.To estimate population abundance and distribution at broad spatiotemporal extents, we developed an integrated model that incorporates unstructured data during time periods and spatial locations when structured data are unavailable. We validated our approach through simulations and then applied the framework to the eastern North American migratory population of monarch butterflies during their spring breeding period in eastern Texas. Spring climate conditions have been identified as a key driver of monarch population sizes during subsequent summer and winter periods. However, low monarch densities during the spring combined with very few design‐based surveys in the region have limited the ability to isolate effects of spring weather variables on monarchs.Simulation results confirmed the ability of our integrated model to accurately and precisely estimate abundance indices and the effects of covariates during locations and time periods in which structured sampling are lacking. In our case study, we combined opportunistic monarch observations during the spring migration and breeding period with structured data from the summer Midwestern breeding grounds. Our model revealed a nonstationary relationship between weather conditions and local monarch abundance during the spring, driven by spatially varying vegetation and temperature conditions.Data for widespread and migratory species are often fragmented across multiple monitoring programs, potentially requiring the use of both structured and unstructured data sources to obtain complete geographic coverage. Our integrated model can estimate population abundance at broad spatiotemporal extents despite structured data gaps during the annual cycle by leveraging opportunistic data.more » « less
-
Abstract Compared to non‐urban environments, cities host ecological communities with altered taxonomic diversity and functional trait composition. However, we know little about how these urban changes take shape over time. Using historical bee (Apoidea: Anthophila) museum specimens supplemented with online repositories and researcher collections, we investigated whether bee species richness tracked urban and human population growth over the past 118 years. We also determined which species were no longer collected, whether those species shared certain traits, and if collector behavior changed over time. We focused on Wake County, North Carolina, United States where human population size has increased over 16 times over the last century along with the urban area within its largest city, Raleigh, which has increased over four times. We estimated bee species richness with occupancy models, and rarefaction and extrapolation curves to account for imperfect detection and sample coverage. To determine if bee traits correlated with when species were collected, we compiled information on native status, nesting habits, diet breadth, and sociality. We used non‐metric multidimensional scaling to determine if individual collectors contributed different bee assemblages over time. In total, there were 328 species collected in Wake County. We found that although bee species richness varied, there was no clear trend in bee species richness over time. However, recent collections (since 2003) were missing 195 species, and there was a shift in trait composition, particularly lost species were below‐ground nesters. The top collectors in the dataset differed in how often they collected bee species, but this was not consistent between historic and contemporary time periods; some contemporary collectors grouped closer together than others, potentially due to focusing on urban habitats. Use of historical collections and complimentary analyses can fill knowledge gaps to help understand temporal patterns of species richness in taxonomic groups that may not have planned long‐term data.more » « less
-
Abstract Contributory science—including citizen and community science—allows scientists to leverage participant‐generated data while providing an opportunity for engaging with local community members. Data yielded by participant‐generated biodiversity platforms allow professional scientists to answer ecological and evolutionary questions across both geographic and temporal scales, which is incredibly valuable for conservation efforts.The data reported to contributory biodiversity platforms, such as eBird and iNaturalist, can be driven by social and ecological variables, leading to biased data. Though empirical work has highlighted the biases in contributory data, little work has articulated how biases arise in contributory data and the societal consequences of these biases.We present a conceptual framework illustrating how social and ecological variables create bias in contributory science data. In this framework, we present four filters—participation,detectability,samplingandpreference—that ultimately shape the type and location of contributory biodiversity data. We leverage this framework to examine data from the largest contributory science platforms—eBird and iNaturalist—in St. Louis, Missouri, the United States, and discuss the potential consequences of biased data.Lastly, we conclude by providing several recommendations for researchers and institutions to move towards a more inclusive field. With these recommendations, we provide opportunities to ameliorate biases in contributory data and an opportunity to practice equitable biodiversity conservation. Read the freePlain Language Summaryfor this article on the Journal blog.more » « less
An official website of the United States government
