This content will become publicly available on December 1, 2023
- Authors:
- ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more »
- Award ID(s):
- 1836353
- Publication Date:
- NSF-PAR ID:
- 10377913
- Journal Name:
- Scientific Data
- Volume:
- 9
- Issue:
- 1
- ISSN:
- 2052-4463
- Sponsoring Org:
- National Science Foundation
More Like this
-
Making the most of biodiversity data requires linking observations of biological species from multiple sources both efficiently and accurately (Bisby 2000, Franz et al. 2016). Aggregating occurrence records using taxonomic names and synonyms is computationally efficient but known to experience significant limitations on accuracy when the assumption of one-to-one relationships between names and biological entities breaks down (Remsen 2016, Franz and Sterner 2018). Taxonomic treatments and checklists provide authoritative information about the correct usage of names for species, including operational representations of the meanings of those names in the form of range maps, reference genetic sequences, or diagnostic traits. They increasingly provide taxonomic intelligence in the form of precise description of the semantic relationships between different published names in the literature. Making this authoritative information Findable, Accessible, Interoperable, and Reusable (FAIR; Wilkinson et al. 2016) would be a transformative advance for biodiversity data sharing and help drive adoption and novel extensions of existing standards such as the Taxonomic Concept Schema and the OpenBiodiv Ontology (Kennedy et al. 2006, Senderov et al. 2018). We call for the greater, global Biodiversity Information Standards (TDWG) and taxonomy community to commit to extending and expanding on how FAIR applies to biodiversity data and includemore »
-
Abstract Understanding biodiversity patterns as well as drivers of population declines, and range losses provides crucial baselines for monitoring and conservation. However, the information needed to evaluate such trends remains unstandardised and sparsely available for many taxonomic groups and habitats, including the cave-dwelling bats and cave ecosystems. We developed the DarkCideS 1.0 ( https://darkcides.org/ ), a global database of bat caves and species synthesised from publicly available information and datasets. The DarkCideS 1.0 is by far the largest database for cave-dwelling bats, which contains information for geographical location, ecological status, species traits, and parasites and hyperparasites for 679 bat species are known to occur in caves or use caves in part of their life histories. The database currently contains 6746 georeferenced occurrences for 402 cave-dwelling bat species from 2002 cave sites in 46 countries and 12 terrestrial biomes. The database has been developed to be collaborative and open-access, allowing continuous data-sharing among the community of bat researchers and conservation biologists to advance bat research and comparative monitoring and prioritisation for conservation.
-
While bees are critical to sustaining a large proportion of global food production, as well as pollinating both wild and cultivated plants, they are decreasing in both numbers and diversity. Our understanding of the factors driving these declines is limited, in part, because we lack sufficient data on the distribution of bee species to predict changes in their geographic range under climate change scenarios. Additionally lacking is adequate data on the behavioral and anatomical traits that may make bees either vulnerable or resilient to human-induced environmental changes, such as habitat loss and climate change. Fortunately, a wealth of associated attributes can be extracted from the specimens deposited in natural history collections for over 100 years. Extending Anthophila Research Through Image and Trait Digitization (Big-Bee) is a newly funded US National Science Foundation Advancing Digitization of Biodiversity Collections project. Over the course of three years, we will create over one million high-resolution 2D and 3D images of bee specimens (Fig. 1), representing over 5,000 worldwide bee species, including most of the major pollinating species. We will also develop tools to measure bee traits from images and generate comprehensive bee trait and image datasets to measure changes through time. The Big-Bee networkmore »
-
Abstract Background Statistical geneticists employ simulation to estimate the power of proposed studies, test new analysis tools, and evaluate properties of causal models. Although there are existing trait simulators, there is ample room for modernization. For example, most phenotype simulators are limited to Gaussian traits or traits transformable to normality, while ignoring qualitative traits and realistic, non-normal trait distributions. Also, modern computer languages, such as Julia, that accommodate parallelization and cloud-based computing are now mainstream but rarely used in older applications. To meet the challenges of contemporary big studies, it is important for geneticists to adopt new computational tools.
Results We present , an open-source Julia package that makes it trivial to quickly simulate phenotypes under a variety of genetic architectures. This package is integrated into our OpenMendel suite for easy downstream analyses. Julia was purpose-built for scientific programming and provides tremendous speed and memory efficiency, easy access to multi-CPU and GPU hardware, and to distributed and cloud-based parallelization. is designed to encourage flexible trait simulation, including via the standard devices of applied statistics, generalized linear models (GLMs) and generalized linear mixed models (GLMMs). also accommodates many study designs: unrelateds, sibships, pedigrees, or a mixture of all three. (Of course, for datamore »
Conclusion The package has three main advantages. (1) It leverages the computational efficiency and ease of use of Julia to provide extremely fast, straightforward simulation of even the most complex genetic models, including GLMs and GLMMs. (2) It can be operated entirely within, but is not limited to, the integrated analysis pipeline of OpenMendel. And finally (3), by allowing a wider range of more realistic phenotype models, brings power calculations and diagnostic tools closer to what investigators might see in real-world analyses.
-
If we better understand how fungal responses to global change are governed by their traits, we can improve predictions of fungal community composition and ecosystem function. Specifically, we can examine trade-offs among traits, in which the allocation of finite resources toward one trait reduces the investment in others. We hypothesized that trade-offs among fungal traits relating to rapid growth, resource capture, and stress tolerance sort fungal species into discrete life history strategies. We used the Biolog Filamentous Fungi database to calculate maximum growth rates of 37 fungal species and then compared them to their functional traits from the fun fun database. In partial support of our hypothesis, maximum growth rate displayed a negative relationship with traits related to resource capture. Moreover, maximum growth rate displayed a positive relationship with amino acid permease, forming a putative Fast Growth life history strategy. A second putative life history strategy is characterized by a positive relationship between extracellular enzymes, including cellobiohydrolase 6, cellobiohydrolase 7, crystalline cellulase AA9, and lignin peroxidase. These extracellular enzymes were negatively related to chitosanase 8, an enzyme that can break down a derivative of chitin. Chitosanase 8 displayed a positive relationship with many traits that were hypothesized to cluster separately,more »