skip to main content


Title: rphenoscate: An R package for semantics‐aware evolutionary analyses of anatomical traits
Abstract

Organismal anatomy is a hierarchical system of anatomical entities often imposing dependencies among multiple morphological characters. Ontologies provide a formal and computable framework for incorporating prior biological knowledge about anatomical dependencies in models of trait evolution. They also offer new opportunities for working with semantic representations of morphological data.

In this work, we present a new R package—rphenoscate—that enables incorporating ontological knowledge in evolutionary analyses and exploring semantic patterns of morphological data. In conjunction withrphenoscape, it allows for assembling synthetic phylogenetic character matrices from semantic phenotypes of morphological data. We showcase the package functionality with data sets from bees and fishes.

We demonstrate that ontologies can be employed to automatically set up evolutionary models accounting for trait dependencies in stochastic character mapping. We also demonstrate how ontology annotations can be explored to interrogate patterns of morphological evolution. Finally, we demonstrate that synthetic character matrices assembled from semantic phenotypes retain most of the phylogenetic information from their original data sets.

Ontologies will become important tools for integrating anatomical knowledge into phylogenetic methods and making morphological data FAIR compliant—a critical step of the ongoing ‘phenomics’ revolution. Our new package offers key advancements towards this goal.

 
more » « less
NSF-PAR ID:
10467602
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley-Blackwell
Date Published:
Journal Name:
Methods in Ecology and Evolution
Volume:
14
Issue:
10
ISSN:
2041-210X
Format(s):
Medium: X Size: p. 2531-2540
Size(s):
["p. 2531-2540"]
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Reconstructing ancestral states for discrete characters is essential for understanding trait evolution in organisms. However, most existing methods are limited to individual characters and often overlook the hierarchical and interactive nature of traits. Recent advances in phylogenetics now offer the possibility of integrating knowledge from anatomy ontologies to reconstruct multiple discrete character histories. Nonetheless, practical applications that fully harness the potential of these new approaches are still lacking.

    This paper introducesontophylo, an R package that extends the PARAMO pipeline to address these limitations.Ontophyloenables the reconstruction of phenotypic entities composed of amalgamated characters, such as anatomical regions or entire phenomes. It offers three new applications: (1) reconstruction of evolutionary rates of amalgamated characters using phylogenetic non‐homogeneous Poisson process (pNHPP) that allows modelling rate variation across tree branches and time; (2) reconstruction of morphospace dynamics; and (3) visualization of evolutionary rates on vector images of organisms.Ontophyloincorporates ontological knowledge to facilitate these applications.

    Benchmarking confirms the accuracy of pNHPP in estimating character rates under different evolutionary scenarios, and example applications demonstrate the utility ofontophyloin studying morphological evolution in Hymenoptera using simulated data.

    Ontophylocan be easily integrated with other ontology‐oriented and general‐purpose R packages and offers new opportunities to examine morphological evolution on a phenomic scale using new and legacy data.

     
    more » « less
  2. Abstract

    Phenotypic data are crucial for understanding genotype–phenotype relationships, assessing the tree of life and revealing trends in trait diversity over time. Large‐scale description of whole organisms for quantitative analyses (phenomics) presents several challenges, and technological advances in the collection of genomic data outpace those for phenomic data. Reasons for this disparity include the time‐consuming and expensive nature of collecting discrete phenotypic data and mining previously published data on a given species (both often requiring anatomical expertise across taxa), and computational challenges involved with analysing high‐dimensional datasets.

    One approach to building approximations of organismal phenomes is to combine published datasets of discrete characters assembled for phylogenetic analyses into a phenomic dataset. Despite a wealth of legacy datasets in the literature for many groups, relatively few methods exist for automating the assembly, analysis, and visualization of phenomic datasets in phylogenetic contexts. Here, we introduce a newrpackagephenotoolsfor integrating (fusing original or legacy datasets), curating (finding and removing duplicates) and visualizing phenomic datasets.

    We demonstrate the utility of the proposed toolkit with a morphological dataset for flightless birds and two morphological datasets for theropod dinosaurs and provide recommendations for character construction to maximize accessibility in future workflows. Visualization tools allow rapid identification of anatomical subregions with difficult or problematic histories of homology.

    We anticipate these tools aiding automation of the assembly and visualization of phenomic datasets to inform evolutionary relationships and rates of phenotypic evolution.

     
    more » « less
  3. Abstract

    Morphology remains a primary source of phylogenetic information for many groups of organisms, and the only one for most fossil taxa. Organismal anatomy is not a collection of randomly assembled and independent “parts”, but instead a set of dependent and hierarchically nested entities resulting from ontogeny and phylogeny. How do we make sense of these dependent and at times redundant characters? One promising approach is using ontologies—structured controlled vocabularies that summarize knowledge about different properties of anatomical entities, including developmental and structural dependencies. Here, we assess whether evolutionary patterns can explain the proximity of ontology-annotated characters within an ontology. To do so, we measure phylogenetic information across characters and evaluate if it matches the hierarchical structure given by ontological knowledge—in much the same way as across-species diversity structure is given by phylogeny. We implement an approach to evaluate the Bayesian phylogenetic information (BPI) content and phylogenetic dissonance among ontology-annotated anatomical data subsets. We applied this to data sets representing two disparate animal groups: bees (Hexapoda: Hymenoptera: Apoidea, 209 chars) and characiform fishes (Actinopterygii: Ostariophysi: Characiformes, 463 chars). For bees, we find that BPI is not substantially explained by anatomy since dissonance is often high among morphologically related anatomical entities. For fishes, we find substantial information for two clusters of anatomical entities instantiating concepts from the jaws and branchial arch bones, but among-subset information decreases and dissonance increases substantially moving to higher-level subsets in the ontology. We further applied our approach to address particular evolutionary hypotheses with an example of morphological evolution in miniature fishes. While we show that phylogenetic information does match ontology structure for some anatomical entities, additional relationships and processes, such as convergence, likely play a substantial role in explaining BPI and dissonance, and merit future investigation. Our work demonstrates how complex morphological data sets can be interrogated with ontologies by allowing one to access how information is spread hierarchically across anatomical concepts, how congruent this information is, and what sorts of processes may play a role in explaining it: phylogeny, development, or convergence. [Apidae; Bayesian phylogenetic information; Ostariophysi; Phenoscape; phylogenetic dissonance; semantic similarity.]

     
    more » « less
  4. Abstract

    Plant functional strategies change considerably as plants develop, driven by intraindividual variability in anatomical, morphological, physiological and architectural traits.

    Developmental trait variation arises through the complex interplay among genetically regulated phase change (i.e. ontogeny), increases in plant age and size, and phenotypic plasticity to changing environmental conditions. Although spatial drivers of intraspecific trait variation have received extensive research attention, developmentally driven intraspecific trait variation is largely overlooked, despite widespread occurrence.

    Ontogenetic trait variation is genetically regulated, leads to dramatic changes in plant phenotypes and evolves in response to predictable changes in environmental conditions as plants develop.

    Evidence has accumulated to support a general shift from fast to slow relative growth rates and from shade to sun leaves as plants develop from the highly competitive but shady juvenile niche to the stressful adult niche in the systems studied to date.

    Nonetheless, there are major gaps in our knowledge due to examination of only a few environmental factors selecting for the evolution of ontogenetic trajectories, variability in how ontogeny is assigned, biogeographic sampling biases on trees in temperate biomes, dependencies on a few broadly sampled leaf morphological traits and a lack of longitudinal studies that track ontogeny within individuals. Filling these gaps will enhance our understanding of plant functional ecology and provide a framework for predicting the effects of global change threats that target specific ontogenetic stages.

    Read the freePlain Language Summaryfor this article on the Journal blog.

     
    more » « less
  5. Abstract Existing phenotype ontologies were originally developed to represent phenotypes that manifest as a character state in relation to a wild-type or other reference. However, these do not include the phenotypic trait or attribute categories required for the annotation of genome-wide association studies (GWAS), Quantitative Trait Loci (QTL) mappings or any population-focussed measurable trait data. The integration of trait and biological attribute information with an ever increasing body of chemical, environmental and biological data greatly facilitates computational analyses and it is also highly relevant to biomedical and clinical applications. The Ontology of Biological Attributes (OBA) is a formalised, species-independent collection of interoperable phenotypic trait categories that is intended to fulfil a data integration role. OBA is a standardised representational framework for observable attributes that are characteristics of biological entities, organisms, or parts of organisms. OBA has a modular design which provides several benefits for users and data integrators, including an automated and meaningful classification of trait terms computed on the basis of logical inferences drawn from domain-specific ontologies for cells, anatomical and other relevant entities. The logical axioms in OBA also provide a previously missing bridge that can computationally link Mendelian phenotypes with GWAS and quantitative traits. The term components in OBA provide semantic links and enable knowledge and data integration across specialised research community boundaries, thereby breaking silos. 
    more » « less