Abstract MotivationTraits are increasingly being used to quantify global biodiversity patterns, with trait databases growing in size and number, across diverse taxa. Despite growing interest in a trait‐based approach to the biodiversity of the deep sea, where the impacts of human activities (including seabed mining) accelerate, there is no single repository for species traits for deep‐sea chemosynthesis‐based ecosystems, including hydrothermal vents. Using an international, collaborative approach, we have compiled the first global‐scale trait database for deep‐sea hydrothermal‐vent fauna – sFDvent (sDiv‐funded trait database for theFunctionalDiversity ofvents). We formed a funded working group to select traits appropriate to: (a) capture the performance of vent species and their influence on ecosystem processes, and (b) compare trait‐based diversity in different ecosystems. Forty contributors, representing expertise across most known hydrothermal‐vent systems and taxa, scored species traits using online collaborative tools and shared workspaces. Here, we characterise the sFDvent database, describe our approach, and evaluate its scope. Finally, we compare the sFDvent database to similar databases from shallow‐marine and terrestrial ecosystems to highlight how the sFDvent database can inform cross‐ecosystem comparisons. We also make the sFDvent database publicly available online by assigning a persistent, unique DOI. Main types of variable containedSix hundred and forty‐six vent species names, associated location information (33 regions), and scores for 13 traits (in categories: community structure, generalist/specialist, geographic distribution, habitat use, life history, mobility, species associations, symbiont, and trophic structure). Contributor IDs, certainty scores, and references are also provided. Spatial location and grainGlobal coverage (grain size: ocean basin), spanning eight ocean basins, including vents on 12 mid‐ocean ridges and 6 back‐arc spreading centres. Time period and grainsFDvent includes information on deep‐sea vent species, and associated taxonomic updates, since they were first discovered in 1977. Time is not recorded. The database will be updated every 5 years. Major taxa and level of measurementDeep‐sea hydrothermal‐vent fauna with species‐level identification present or in progress. Software format.csv and MS Excel (.xlsx).
more »
« less
A review of the heterogeneous landscape of biodiversity databases: Opportunities and challenges for a synthesized biodiversity knowledge base
Abstract AimAddressing global environmental challenges requires access to biodiversity data across wide spatial, temporal and taxonomic scales. Availability of such data has increased exponentially recently with the proliferation of biodiversity databases. However, heterogeneous coverage, protocols, and standards have hampered integration among these databases. To stimulate the next stage of data integration, here we present a synthesis of major databases, and investigate (a) how the coverage of databases varies across taxonomy, space, and record type; (b) what degree of integration is present among databases; (c) how integration of databases can increase biodiversity knowledge; and (d) the barriers to database integration. LocationGlobal. Time periodContemporary. Major taxa studiedPlants and vertebrates. MethodsWe reviewed 12 established biodiversity databases that mainly focus on geographic distributions and functional traits at global scale. We synthesized information from these databases to assess the status of their integration and major knowledge gaps and barriers to full integration. We estimated how improved integration can increase the data coverage for terrestrial plants and vertebrates. ResultsEvery database reviewed had a unique focus of data coverage. Exchanges of biodiversity information were common among databases, although not always clearly documented. Functional trait databases were more isolated than those pertaining to species distributions. Variation and potential incompatibility of taxonomic systems used by different databases posed a major barrier to data integration. We found that integration of distribution databases could lead to increased taxonomic coverage that corresponds to 23 years’ advancement in data accumulation, and improvement in taxonomic coverage could be as high as 22.4% for trait databases. Main conclusionsRapid increases in biodiversity knowledge can be achieved through the integration of databases, providing the data necessary to address critical environmental challenges. Full integration across databases will require tackling the major impediments to data integration: taxonomic incompatibility, lags in data exchange, barriers to effective data synchronization, and isolation of individual initiatives.
more »
« less
- PAR ID:
- 10444024
- Author(s) / Creator(s):
- ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more »
- Publisher / Repository:
- Wiley-Blackwell
- Date Published:
- Journal Name:
- Global Ecology and Biogeography
- Volume:
- 31
- Issue:
- 7
- ISSN:
- 1466-822X
- Page Range / eLocation ID:
- p. 1242-1260
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract MotivationBiodiversity in many areas is rapidly declining because of global change. As such, there is an urgent need for new tools and strategies to help identify, monitor and conserve biodiversity hotspots. This is especially true for frugivores, species consuming fruit, because of their important role in seed dispersal and maintenance of forest structure and health. One way to identify these areas is by quantifying functional diversity, which measures the unique roles of species within a community and is valuable for conservation because of its relationship with ecosystem functioning. Unfortunately, the functional trait information required for these studies can be sparse for certain taxa and specific traits and difficult to harmonize across disparate data sources, especially in biodiversity hotspots. To help fill this need, we compiled Frugivoria, a trait database containing ecological, life‐history, morphological and geographical traits for mammals and birds exhibiting frugivory. Frugivoria encompasses species in contiguous moist montane forests and adjacent moist lowland forests of Central and South America—the latter specifically focusing on the Andean states. Compared with existing trait databases, Frugivoria harmonizes existing trait databases, adds new traits, extends traits originally only available for mammals to birds also and fills gaps in trait categories from other databases. Furthermore, we create a cross‐taxa subset of shared traits to aid in analysis of mammals and birds. In total, Frugivoria adds 8662 new trait values for mammals and 14,999 for birds and includes a total of 45,216 trait entries with only 11.37% being imputed. Frugivoria also contains an open workflow that harmonizes trait and taxonomic data from disparate sources and enables users to analyse traits in space. As such, this open‐access database, which aligns with FAIR data principles, fills a major knowledge gap, enabling more comprehensive trait‐based studies of species in this ecologically important region. Main Types of Variable ContainedEcological, life‐history, morphological and geographical traits. Spatial Location and GrainNeotropical countries (Mexico, Guatemala, Costa Rica, Panama, El Salvador, Belize, Nicaragua, Ecuador, Colombia, Peru, Bolivia, Argentina, Venezuela and Chile) with contiguous montane regions. Time Period and GrainIUCN spatial data: obtained February 2023, spanning range maps collated from 1998 to 2022. IUCN species data: obtained June 2019–September 2022. Newly included traits: span 1924 to 2023. Major Taxa and Level of MeasurementClasses Mammalia and Aves; 40,074 species‐level traits; 5142 imputed traits for 1733 species (mammals: 582; birds: 1147) and 16 sub‐species (mammals). Software Format.csv; R.more » « less
-
Summary Biodiversity knowledge gaps and biases persist across low-income tropical regions. Genetic data are essential for addressing these issues, supporting biodiversity research and conservation planning. To assess progress in wildlife genetic sampling within the Philippines, I evaluated the scope, representativeness, and growth of publicly available genetic data and research on endemic vertebrates from the 1990s through 2024. Results showed that 82.3% of the Philippines’ 769 endemic vertebrates have genetic data, although major disparities remain. Reptiles had the least complete coverage but exhibited the highest growth, with birds, mammals, and amphibians following in that order. Species confined to smaller biogeographic subregions, with narrow geographic ranges, or classified as threatened or lacking threat assessments were disproportionately underrepresented. Research output on reptiles increased markedly, while amphibian research lagged behind. Although the number of non-unique authors in wildlife genetics studies involving Philippine specimens has grown steeply, Filipino involvement remains low. These results highlight the uneven and non-random distribution of wildlife genetic knowledge within this global biodiversity hotspot. Moreover, the limited participation of Global South researchers underscores broader inequities in wildlife genomics. Closing these gaps and addressing biases creates a more equitable and representative genetic knowledge base and supports its integration into national conservation efforts aligned with global biodiversity commitments.more » « less
-
Abstract PremisePlant trait data are essential for quantifying biodiversity and function across Earth, but these data are challenging to acquire for large studies. Diverse strategies are needed, including the liberation of heritage data locked within specialist literature such as floras and taxonomic monographs. Here we report FloraTraiter, a novel approach using rule‐based natural language processing (NLP) to parse computable trait data from biodiversity literature. MethodsFloraTraiter was implemented through collaborative work between programmers and botanical experts and customized for both online floras and scanned literature. We report a strategy spanning optical character recognition, recognition of taxa, iterative building of traits, and establishing linkages among all of these, as well as curational tools and code for turning these results into standard morphological matrices. ResultsOver 95% of treatment content was successfully parsed for traits with <1% error. Data for more than 700 taxa are reported, including a demonstration of common downstream uses. ConclusionsWe identify strategies, applications, tips, and challenges that we hope will facilitate future similar efforts to produce large open‐source trait data sets for broad community reuse. Largely automated tools like FloraTraiter will be an important addition to the toolkit for assembling trait data at scale.more » « less
-
ABSTRACT MotivationFreshwater ecosystems have been heavily impacted by land‐use changes, but data syntheses on these impacts are still limited. Here, we compiled a global database encompassing 241 studies with species abundance data (from multiple biological groups and geographic locations) across sites with different land‐use categories. This compilation will be useful for addressing questions regarding land‐use change and its impact on freshwater biodiversity. Main Types of Variables ContainedThe database includes metadata of each study, sites location, sample methods, sample time, land‐use category and abundance of each taxon. Spatial Location and GrainThe database contains data from across the globe, with 85% of the sites having well‐defined geographical coordinates. Major Taxa and Level of MeasurementThe database covers all major freshwater biological groups including algae, macrophytes, zooplankton, macroinvertebrates, fish and amphibians.more » « less
An official website of the United States government
