skip to main content

Title: A review of the heterogeneous landscape of biodiversity databases: Opportunities and challenges for a synthesized biodiversity knowledge base
Abstract Aim

Addressing global environmental challenges requires access to biodiversity data across wide spatial, temporal and taxonomic scales. Availability of such data has increased exponentially recently with the proliferation of biodiversity databases. However, heterogeneous coverage, protocols, and standards have hampered integration among these databases. To stimulate the next stage of data integration, here we present a synthesis of major databases, and investigate (a) how the coverage of databases varies across taxonomy, space, and record type; (b) what degree of integration is present among databases; (c) how integration of databases can increase biodiversity knowledge; and (d) the barriers to database integration.



Time period


Major taxa studied

Plants and vertebrates.


We reviewed 12 established biodiversity databases that mainly focus on geographic distributions and functional traits at global scale. We synthesized information from these databases to assess the status of their integration and major knowledge gaps and barriers to full integration. We estimated how improved integration can increase the data coverage for terrestrial plants and vertebrates.


Every database reviewed had a unique focus of data coverage. Exchanges of biodiversity information were common among databases, although not always clearly documented. Functional trait databases were more isolated than those pertaining to species distributions. Variation and potential incompatibility of taxonomic systems used by different databases posed a major barrier to data integration. We found that integration of distribution databases could lead to increased taxonomic coverage that corresponds to 23 years’ advancement in data accumulation, and improvement in taxonomic coverage could be as high as 22.4% for trait databases.

Main conclusions

Rapid increases in biodiversity knowledge can be achieved through the integration of databases, providing the data necessary to address critical environmental challenges. Full integration across databases will require tackling the major impediments to data integration: taxonomic incompatibility, lags in data exchange, barriers to effective data synchronization, and isolation of individual initiatives.

more » « less
Award ID(s):
2019470 2017949
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  more » ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;   « less
Publisher / Repository:
Date Published:
Journal Name:
Global Ecology and Biogeography
Page Range / eLocation ID:
p. 1242-1260
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Motivation

    Traits are increasingly being used to quantify global biodiversity patterns, with trait databases growing in size and number, across diverse taxa. Despite growing interest in a trait‐based approach to the biodiversity of the deep sea, where the impacts of human activities (including seabed mining) accelerate, there is no single repository for species traits for deep‐sea chemosynthesis‐based ecosystems, including hydrothermal vents. Using an international, collaborative approach, we have compiled the first global‐scale trait database for deep‐sea hydrothermal‐vent fauna – sFDvent (sDiv‐funded trait database for theFunctionalDiversity ofvents). We formed a funded working group to select traits appropriate to: (a) capture the performance of vent species and their influence on ecosystem processes, and (b) compare trait‐based diversity in different ecosystems. Forty contributors, representing expertise across most known hydrothermal‐vent systems and taxa, scored species traits using online collaborative tools and shared workspaces. Here, we characterise the sFDvent database, describe our approach, and evaluate its scope. Finally, we compare the sFDvent database to similar databases from shallow‐marine and terrestrial ecosystems to highlight how the sFDvent database can inform cross‐ecosystem comparisons. We also make the sFDvent database publicly available online by assigning a persistent, unique DOI.

    Main types of variable contained

    Six hundred and forty‐six vent species names, associated location information (33 regions), and scores for 13 traits (in categories: community structure, generalist/specialist, geographic distribution, habitat use, life history, mobility, species associations, symbiont, and trophic structure). Contributor IDs, certainty scores, and references are also provided.

    Spatial location and grain

    Global coverage (grain size: ocean basin), spanning eight ocean basins, including vents on 12 mid‐ocean ridges and 6 back‐arc spreading centres.

    Time period and grain

    sFDvent includes information on deep‐sea vent species, and associated taxonomic updates, since they were first discovered in 1977. Time is not recorded. The database will be updated every 5 years.

    Major taxa and level of measurement

    Deep‐sea hydrothermal‐vent fauna with species‐level identification present or in progress.

    Software format

    .csv and MS Excel (.xlsx).

    more » « less
  2. Abstract Motivation

    Biodiversity in many areas is rapidly declining because of global change. As such, there is an urgent need for new tools and strategies to help identify, monitor and conserve biodiversity hotspots. This is especially true for frugivores, species consuming fruit, because of their important role in seed dispersal and maintenance of forest structure and health. One way to identify these areas is by quantifying functional diversity, which measures the unique roles of species within a community and is valuable for conservation because of its relationship with ecosystem functioning. Unfortunately, the functional trait information required for these studies can be sparse for certain taxa and specific traits and difficult to harmonize across disparate data sources, especially in biodiversity hotspots. To help fill this need, we compiled Frugivoria, a trait database containing ecological, life‐history, morphological and geographical traits for mammals and birds exhibiting frugivory. Frugivoria encompasses species in contiguous moist montane forests and adjacent moist lowland forests of Central and South America—the latter specifically focusing on the Andean states. Compared with existing trait databases, Frugivoria harmonizes existing trait databases, adds new traits, extends traits originally only available for mammals to birds also and fills gaps in trait categories from other databases. Furthermore, we create a cross‐taxa subset of shared traits to aid in analysis of mammals and birds. In total, Frugivoria adds 8662 new trait values for mammals and 14,999 for birds and includes a total of 45,216 trait entries with only 11.37% being imputed. Frugivoria also contains an open workflow that harmonizes trait and taxonomic data from disparate sources and enables users to analyse traits in space. As such, this open‐access database, which aligns with FAIR data principles, fills a major knowledge gap, enabling more comprehensive trait‐based studies of species in this ecologically important region.

    Main Types of Variable Contained

    Ecological, life‐history, morphological and geographical traits.

    Spatial Location and Grain

    Neotropical countries (Mexico, Guatemala, Costa Rica, Panama, El Salvador, Belize, Nicaragua, Ecuador, Colombia, Peru, Bolivia, Argentina, Venezuela and Chile) with contiguous montane regions.

    Time Period and Grain

    IUCN spatial data: obtained February 2023, spanning range maps collated from 1998 to 2022. IUCN species data: obtained June 2019–September 2022. Newly included traits: span 1924 to 2023.

    Major Taxa and Level of Measurement

    Classes Mammalia and Aves; 40,074 species‐level traits; 5142 imputed traits for 1733 species (mammals: 582; birds: 1147) and 16 sub‐species (mammals).

    Software Format

    .csv; R.

    more » « less
  3. BACKGROUND The Republic of Madagascar is home to a unique assemblage of taxa and a diverse set of ecosystems. These high levels of diversity have arisen over millions of years through complex processes of speciation and extinction. Understanding this extraordinary diversity is crucial for highlighting its global importance and guiding urgent conservation efforts. However, despite the detailed knowledge that exists on some taxonomic groups, there are large knowledge gaps that remain to be filled. ADVANCES Our comprehensive analysis of major taxonomic groups in Madagascar summarizes information on the origin and evolution of terrestrial and freshwater biota, current species richness and endemism, and the utilization of this biodiversity by humans. The depth and breadth of Madagascar’s biodiversity—the product of millions of years of evolution in relative isolation —is still being uncovered. We report a recent acceleration in the scientific description of species but many remain relatively unknown, particularly fungi and most invertebrates. DIGITIZATION Digitization efforts are already increasing the resolution of species richness patterns and we highlight the crucial role of field- and collections-based research for advancing biodiversity knowledge in Madagascar. Phylogenetic diversity patterns mirror that of species richness and endemism in most of the analyzed groups. Among the new data presented, our update on plant numbers estimates 11,516 described vascular plant species native to Madagascar, of which 82% are endemic, in addition to 1215 bryophyte species, of which 28% are endemic. Humid forests are highlighted as centers of diversity because of their role as refugia and centers of recent and rapid radiations, but the distinct endemism of other areas such as the grassland-woodland mosaic of the Central Highlands and the spiny forest of the southwest is also important despite lower species richness. Endemism in Malagasy fungi remains poorly known given the lack of data on the total diversity and global distribution of species. However, our analysis has shown that ~75% of the fungal species detected by environmental sequencing have not been reported as occurring outside of Madagascar. Among the 1314 species of native terrestrial and freshwater vertebrates, levels of endemism are extremely high (90% overall)—all native nonflying terrestrial mammals and native amphibians are found nowhere else on Earth; further, 56% of the island’s birds, 81% of freshwater fishes, 95% of mammals, and 98% of reptile species are endemic. Little is known about endemism in insects, but data from the few well-studied groups on the island suggest that it is similarly high. The uses of Malagasy species are many, with much potential for the uncovering of useful traits for food, medicine, and climate mitigation. OUTLOOK Considerable work remains to be done to fully characterize Madagascar’s biodiversity and evolutionary history. The multitudes of known and potential uses of Malagasy species reported here, in conjunction with the inherent value of this unique and biodiverse region, reinforce the importance of conserving this unique biota in the face of major threats such as habitat loss and overexploitation. The gathering and analysis of data on Madagascar’s remarkable biota must continue and accelerate if we are to safeguard this unique and highly threatened subset of Earth’s biodiversity. Emergence and composition of Madagascar’s extraordinary biodiversity. Madagascar’s biota is the result of over 160 million years of evolution, mostly in geographic isolation, combined with sporadic long distance immigration events and local extinctions. (Left) We show the age of the oldest endemic Malagasy clade for major groups (from bottom to top): arthropods, bony fishes, reptiles, flatworms, birds, amphibians, flowering plants, mammals, non-flowering vascular plants, and mollusks). Humans arrived recently, some 10,000 to 2000 years (top right) and have directly or indirectly caused multiple extinctions (including hippopotamus, elephant birds, giant tortoises, and giant lemurs) and introduced many new species (such as dogs, zebu, rats, African bushpigs, goats, sheep, rice). Endemism is extremely high and unevenly distributed across the island (the heat map depicts Malagasy palm diversity, a group characteristic of the diverse humid forest). Human use of biodiversity is widespread, including 1916 plant species with reported uses. The scientific description of Malagasy biodiversity has accelerated greatly in recent years (bottom right), yet the diversity and evolution of many groups remain practically unknown, and many discoveries await. 
    more » « less
  4. BACKGROUND Madagascar is one of the world’s foremost biodiversity hotspots. Its unique assemblage of plants, animals, and fungi—the majority of which evolved on the island and occur nowhere else—is both diverse and threatened. After human arrival, the island’s entire megafauna became extinct, and large portions of the current flora and fauna may be on track for a similar fate. Conditions for the long-term survival of many Malagasy species are not currently met because of multiple anthropogenic threats. ADVANCES We review the extinction risk and threats to biodiversity in Madagascar, using available international assessment data as well as a machine learning analysis to predict the extinction risks and threats to plant species lacking assessments. Our compilation of global International Union for Conservation of Nature (IUCN) Red List assessments shows that overexploitation alongside unsustainable agricultural practices affect 62.1 and 56.8% of vertebrate species, respectively, and each affects nearly 90% of all plant species. Other threats have a relatively minor effect today but are expected to increase in coming decades. Because only one-third (4652) of all Malagasy plant species have been formally assessed, we carried out a neural network analysis to predict the putative status and threats for 5887 unassessed species and to evaluate biases in current assessments. The percentage of plant species currently assessed as under threat is probably representative of actual numbers, except in the case of the ferns and lycophytes, where significantly more species are estimated to be threatened. We find that Madagascar is home to a disproportionately high number of Evolutionarily Distinct and Globally Endangered (EDGE) species. This further highlights the urgency for evidence-based and effective in situ and ex situ conservation. Despite these alarming statistics and trends, we find that 10.4% of Madagascar’s land area is protected and that the network of protected areas (PAs) covers at least part of the range of 97.1% of terrestrial and freshwater vertebrates with known distributions (amphibians, freshwater fishes, reptiles, birds, and mammal species combined) and 67.7% of plant species (for threatened species, the percentages are 97.7% for vertebrates and 79.6% for plants). Complementary to this, ex situ collections hold 18% of vertebrate species and 23% of plant species. Nonetheless, there are still many threatened species that do not occur within PAs and are absent from ex situ collections, including one amphibian, three mammals, and seven reptiles, as well as 559 plants and more yet to be assessed. Based on our updated vegetation map, we find that the current PA network provides good coverage of the major habitats, particularly mangroves, spiny forest, humid forest, and tapia, but subhumid forest and grassland-woodland mosaic have very low areas under protection (5.7 and 1.8% respectively). OUTLOOK Madagascar is among the world’s poorest countries, and its biodiversity is a key resource for the sustainable future and well-being of its citizens. Current threats to Madagascar’s biodiversity are deeply rooted in historical and present social contexts, including widespread inequalities. We therefore propose five opportunities for action to further conservation in a just and equitable way. First, investment in conservation and restoration must be based on evidence and effectiveness and be tailored to meet future challenges through inclusive solutions. Second, expanded biodiversity monitoring, including increased dataset production and availability, is key. Third, improving the effectiveness of existing PAs—for example through community engagement, training, and income opportunities—is more important than creating new ones. Fourth, conservation and restoration should not focus solely on the PA network but should also include the surrounding landscapes and communities. And finally, conservation actions must address the root causes of biodiversity loss, including poverty and food insecurity. In the eyes of much of the world, Madagascar’s biodiversity is a unique global asset that needs saving; in the daily lives of many of the Malagasy people, it is a rapidly diminishing source of the most basic needs for subsistence. Protecting Madagascar’s biodiversity while promoting social development for its people is a matter of the utmost urgency Visual representation of five key opportunities for conserving and restoring Madagascar’s rapidly declining biodiversity identified in this Review. The dashed lines point to representative vegetation types where these recommendations could have tangible effects, but the opportunities are applicable across Madagascar. ILLUSTRATION: INESSA VOET 
    more » « less
  5. Abstract Aim

    Plant trait databases often contain traits that are correlated, but for whom direct (undirected statistical dependency) and indirect (mediated by other traits) connections may be confounded. The confounding of correlation and connection hinders our understanding of plant strategies, and how these vary among growth forms and climate zones. We identified the direct and indirect connections across plant traits relevant to competition, resource acquisition and reproductive strategies using a global database and explored whether connections within and between traits from different tissue types vary across climates and growth forms.



    Major taxa studied


    Time period



    We used probabilistic graphical models and a database of 10 plant traits (leaf area, specific leaf area, mass‐ and area‐based leaf nitrogen and phosphorous content, leaf life span, plant height, stem specific density and seed mass) with 16,281 records to describe direct and indirect connections across woody and non‐woody plants across tropical, temperate, arid, cold and polar regions.


    Trait networks based on direct connections are sparser than those based on correlations. Land plants had high connectivity across traits within and between tissue types; leaf life span and stem specific density shared direct connections with all other traits. For both growth forms, two groups of traits form modules of more highly connected traits; one related to resource acquisition, the other to plant architecture and reproduction. Woody species had higher trait network modularity in polar compared to temperate and tropical climates, while non‐woody species did not show significant differences in modularity across climate regions.

    Main conclusions

    Plant traits are highly connected both within and across tissue types, yet traits segregate into persistent modules of traits. Variation in the modularity of trait networks suggests that trait connectivity is shaped by prevailing environmental conditions and demonstrates that plants of different growth forms use alternative strategies to cope with local conditions.

    more » « less