skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Confusion will be my epitaph: genome-scale discordance stifles phylogenetic resolution of Holothuroidea
Sea cucumbers (Holothuroidea) are a diverse clade of echinoderms found from intertidal waters to the bottom of the deepest oceanic trenches. Their reduced skeletons and limited number of phylogenetically informative traits have long obfuscated morphological classifications. Sanger-sequenced molecular datasets have also failed to constrain the position of major lineages. Noteworthy, topological uncertainty has hindered a resolution for Neoholothuriida, a highly diverse clade of Permo-Triassic age. We perform the first phylogenomic analysis of Holothuroidea, combining existing datasets with 13 novel transcriptomes. Using a highly curated dataset of 1100 orthologues, our efforts recapitulate previous results, struggling to resolve interrelationships among neoholothuriid clades. Three approaches to phylogenetic reconstruction (concatenation under both site-homogeneous and site-heterogeneous models, and coalescent-aware inference) result in alternative resolutions, all of which are recovered with strong support and across a range of datasets filtered for phylogenetic usefulness. We explore this intriguing result using gene-wise log-likelihood scores and attempt to correlate these with a large set of gene properties. While presenting novel ways of exploring and visualizing support for alternative trees, we are unable to discover significant predictors of topological preference, and our efforts fail to favour one topology. Neoholothuriid genomes seem to retain an amalgam of signals derived from multiple phylogenetic histories.  more » « less
Award ID(s):
2036186
PAR ID:
10498069
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
https://royalsocietypublishing.org/doi/epdf/10.1098/rspb.2023.0988
Date Published:
Journal Name:
Proceedings of the Royal Society B: Biological Sciences
Volume:
290
Issue:
2002
ISSN:
0962-8452
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Gene tree discordance is expected in phylogenomic trees and biological processes are often invoked to explain it. However, heterogeneous levels of phylogenetic signal among individuals within data sets may cause artifactual sources of topological discordance. We examined how the information content in tips and subclades impacts topological discordance in the parrots (Order: Psittaciformes), a diverse and highly threatened clade of nearly 400 species. Using ultraconserved elements from 96% of the clade’s species-level diversity, we estimated concatenated and species trees for 382 ingroup taxa. We found that discordance among tree topologies was most common at nodes dating between the late Miocene and Pliocene, and often at the taxonomic level of the genus. Accordingly, we used two metrics to characterize information content in tips and assess the degree to which conflict between trees was being driven by lower-quality samples. Most instances of topological conflict and nonmonophyletic genera in the species tree could be objectively identified using these metrics. For subclades still discordant after tip-based filtering, we used a machine learning approach to determine whether phylogenetic signal or noise was the more important predictor of metrics supporting the alternative topologies. We found that when signal favored one of the topologies, the noise was the most important variable in poorly performing models that favored the alternative topology. In sum, we show that artifactual sources of gene tree discordance, which are likely a common phenomenon in many data sets, can be distinguished from biological sources by quantifying the information content in each tip and modeling which factors support each topology. [Historical DNA; machine learning; museomics; Psittaciformes; species tree.] 
    more » « less
  2. Ruane, Sara (Ed.)
    Abstract Genome-scale data have the potential to clarify phylogenetic relationships across the tree of life but have also revealed extensive gene tree conflict. This seeming paradox, whereby larger data sets both increase statistical confidence and uncover significant discordance, suggests that understanding sources of conflict is important for accurate reconstruction of evolutionary history. We explore this paradox in squamate reptiles, the vertebrate clade comprising lizards, snakes, and amphisbaenians. We collected an average of 5103 loci for 91 species of squamates that span higher-level diversity within the clade, which we augmented with publicly available sequences for an additional 17 taxa. Using a locus-by-locus approach, we evaluated support for alternative topologies at 17 contentious nodes in the phylogeny. We identified shared properties of conflicting loci, finding that rate and compositional heterogeneity drives discordance between gene trees and species tree and that conflicting loci rarely overlap across contentious nodes. Finally, by comparing our tests of nodal conflict to previous phylogenomic studies, we confidently resolve 9 of the 17 problematic nodes. We suggest this locus-by-locus and node-by-node approach can build consensus on which topological resolutions remain uncertain in phylogenomic studies of other contentious groups. [Anchored hybrid enrichment (AHE); gene tree conflict; molecular evolution; phylogenomic concordance; target capture; ultraconserved elements (UCE).] 
    more » « less
  3. N/A (Ed.)
    Abstract Medicago truncatulais a model legume that has been extensively investigated in diverse subdisciplines of plant science.Medicago littoraliscan interbreed withM. truncatulaandM. italica; these three closely related species form a clade, i.e. TLI clade. Genetic studies have indicated thatM. truncatulaaccessions are heterogeneous but their taxonomic identities have not been verified. To elucidate the phylogenetic position of diverseM. truncatulaaccessions within the genus, we assembled 54 plastid genomes (plastomes) using publicly available next-generation sequencing data and conducted phylogenetic analyses using maximum likelihood. Five accessions showed high levels of plastid DNA polymorphism. Three of these highly polymorphic accessions contained sequences from bothM. truncatulaandM. littoralis.Phylogenetic analyses of sequences placed some accessions closer to distantly related species suggesting misidentification of source material.Most accessions were placed within the TLI clade and maximally supported the interrelationships of three subclades. TwoMedicagoaccessions were placed within aM. italicasubclade of the TLI clade. Plastomes with a 45-kb (rpl20-ycf1) inversion were placed within theM. littoralissubclade. Our results suggest that theM. truncatulaaccession genome pool represents more than one species due to possible mistaken identities and gene flow among closely related species. 
    more » « less
  4. Cnidarians are critical members of aquatic communities and have been an experimental system for a diversity of research areas ranging from development to biomechanics to global change biology. Yet, we still lack a well-resolved, taxonomically balanced cnidarian tree of life to place this research in appropriate phylogenetic context. To move towards this goal, we combined data from 26 new anthozoan transcriptomes with 86 previously published cnidarian and outgroup datasets to generate two 748-locus alignments containing 123,051 (trimmed) and 449,935 (untrimmed) amino acids. We estimated maximum likelihood phylogenies for both matrices under partitioned and unpartitioned site-homogeneous and site-heterogenous models of substitution. We used the resulting topology to constrain a phylogenetic analysis of 1,814 small subunit ribosomal (18S) gene sequences from GenBank. Our results confirm the position of Ceriantharia (tube-dwelling anemones), a historically recalcitrant group, as sister to the rest of Hexacorallia across all phylogenies regardless of data matrix or model choice. We find unanimous support for the sister relationships of Scleractinia and Corallimorpharia and of Endocnidozoa and Medusozoa. We propose the name Coralliformes for the clade uniting scleractinians and corallimorpharians and the name Operculozoa for the clade uniting endocnidozoans and medusozoans. Of the 229 genera with more than a single representative in our 18S hybrid phylogeny, 47 (21%) were identified as monophyletic, providing a starting point for a number of taxonomic revisions. Together, these data are an invaluable resource for comparative cnidarian research and provide perspective to guide future refinement of cnidarian systematics. 
    more » « less
  5. Abstract Rapid species radiations present difficulties for phylogenetic reconstruction due to lack of phylogenetic information and processes such as deep coalescence/incomplete lineage sorting and hybridization. Phylogenomic data can overcome some of these difficulties. In this study, we use anchored hybrid enrichment (AHE) nuclear phylogenomic data and mitochondrial genomes recovered from AHE bycatch with several concatenated and coalescent approaches to reconstruct the poorly resolved radiation of the New Zealand cicada species in the generaKikihiaDugdale andMaoricicadaDugdale. Compared with previous studies using only three to five Sanger‐sequenced genes, we find increased resolution across our phylogenies, but several branches remain unresolved due to topological conflict among genes. Some nodes that are strongly supported by traditional support measures like bootstraps and posterior probabilities still show significant gene and site concordance conflict. In addition, we find strong mito‐nuclear discordance; likely the result of interspecific hybridization events in the evolutionary history ofKikihiaandMaoricicada. 
    more » « less