Scientists world-wide are putting together massive efforts to understand how the biodiversity that we see on Earth evolved from single-cell organisms at the origin of life and this diversification process is represented through the Tree of Life. Low sampling rates and high heterogeneity in the rate of evolution across sites and lineages produce a phenomenon denoted “long branch attraction” (LBA) in which long non-sister lineages are estimated to be sisters regardless of their true evolutionary relationship. LBA has been a pervasive problem in phylogenetic inference affecting different types of methodologies from distance-based to likelihood-based. Here, we present a novel neural network model that outperforms standard phylogenetic methods and other neural network implementations under LBA settings. Furthermore, unlike existing neural network models in phylogenetics, our model naturally accounts for the tree isomorphisms via permutation invariant functions which ultimately result in lower memory and allows the seamless extension to larger trees.
- NSF-PAR ID:
- 10216898
- Editor(s):
- Pupko, Tal
- Date Published:
- Journal Name:
- Molecular Biology and Evolution
- ISSN:
- 0737-4038
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Ouangraoua, Aida (Ed.)
Abstract -
Abstract Scorpions are ancient and historically renowned for their potent venom. Traditionally, the systematics of this group of arthropods was supported by morphological characters, until recent phylogenomic analyses (using RNAseq data) revealed most of the higher‐level taxa to be non‐monophyletic. While these phylogenomic hypotheses are stable for almost all lineages, some nodes have been hard to resolve due to minimal taxonomic sampling (e.g. family Chactidae). In the same line, it has been shown that some nodes in the Arachnid Tree of Life show disagreement between hypotheses generated using transcritptomes and other genomic sources such as the ultraconserved elements (UCEs). Here, we compared the phylogenetic signal of transcriptomes vs. UCEs by retrieving UCEs from new and previously published scorpion transcriptomes and genomes, and reconstructed phylogenies using both datasets independently. We reexamined the monophyly and phylogenetic placement of Chactidae, sampling an additional chactid species using both datasets. Our results showed that both sets of genome‐scale datasets recovered highly similar topologies, with Chactidae rendered paraphyletic owing to the placement of
Nullibrotheas allenii . As a first step toward redressing the systematics of Chactidae, we establish the family Anuroctonidae (new family) to accommodate the genusAnuroctonus . -
Abstract Phylogenomic analysis of large genome-wide sequence data sets can resolve phylogenetic tree topologies for large species groups, help test the accuracy of and improve resolution for earlier multi-locus studies and reveal the level of agreement or concordance within partitions of the genome for various tree topologies. Here we used a target-capture approach to sequence 1088 single-copy exons for more than 200 labrid fishes together with more than 100 outgroup taxa to generate a new data-rich phylogeny for the family Labridae. Our time-calibrated phylogenetic analysis of exon-capture data pushes the root node age of the family Labridae back into the Cretaceous to about 79 Ma years ago. The monotypic Centrogenys vaigiensis, and the order Uranoscopiformes (stargazers) are identified as the sister lineages of Labridae. The phylogenetic relationships among major labrid subfamilies and within these clades were largely congruent with prior analyses of select mitochondrial and nuclear datasets. However, the position of the tribe Cirrhilabrini (fairy and flame wrasses) showed discordance, resolving either as the sister to a crown julidine clade or alternatively sister to a group formed by the labrines, cheilines and scarines. Exploration of this pattern using multiple approaches leads to slightly higher support for this latter hypothesis, highlighting the importance of genome-level data sets for resolving short internodes at key phylogenetic positions in a large, economically important groups of coral reef fishes. More broadly, we demonstrate how accounting for sources of biological variability from incomplete lineage sorting and exploring systematic error at conflicting nodes can aid in evaluating alternative phylogenetic hypotheses. [coral reefs; divergence time estimation; exon-capture; fossil calibration; incomplete lineage sorting.]
-
Cerretti, Pierfilippo (Ed.)The schizophoran superfamily Ephydroidea (Diptera: Cyclorrhapha) includes eight families, ranging from the well-known vinegar flies (Drosophilidae) and shore flies (Ephydridae), to several small, relatively unusual groups, the phylogenetic placement of which has been particularly challenging for systematists. An extraordinary diversity in life histories, feeding habits and morphology are a hallmark of fly biology, and the Ephydroidea are no exception. Extreme specialization can lead to “orphaned” taxa with no clear evidence for their phylogenetic position. To resolve relationships among a diverse sample of Ephydroidea, including the highly modified flies in the families Braulidae and Mormotomyiidae, we conducted phylogenomic sampling. Using exon capture from Anchored Hybrid Enrichment and transcriptomics to obtain 320 orthologous nuclear genes sampled for 32 species of Ephydroidea and 11 outgroups, we evaluate a new phylogenetic hypothesis for representatives of the superfamily. These data strongly support monophyly of Ephydroidea with Ephydridae as an early branching radiation and the placement of Mormotomyiidae as a family-level lineage sister to all remaining families. We confirm placement of Cryptochetidae as sister taxon to a large clade containing both Drosophilidae and Braulidae–the latter a family of honeybee ectoparasites. Our results reaffirm that sampling of both taxa and characters is critical in hyperdiverse clades and that these factors have a major influence on phylogenomic reconstruction of the history of the schizophoran fly radiation.more » « less
-
Morphological characters and nuclear ribosomal DNA (rDNA) phylogenies have so far been the basis of the current classifications of arbuscular mycorrhizal (AM) fungi. Improved understanding of the evolutionary history of AM fungi requires extensive ortholog sampling and analyses of genome and transcriptome data from a wide range of taxa. To circumvent the need for axenic culturing of AM fungi we gathered and combined genomic data from single nuclei to generate de novo genome assemblies covering seven families of AM fungi. We successfully sequenced the genomes of 15 AM fungal species for which genome data was not previously available. Comparative analysis of the previously published Rhizophagus irregularis DAOM197198 assembly confirm that our novel workflow generates genome assemblies suitable for phylogenomic analysis. Predicted genes of our assemblies, together with published protein sequences of AM fungi and their sister clades, were used for phylogenomic analyses. We evaluated the phylogenetic placement of Glomeromycota in relation to its sister phyla (Mucoromycota and Mortierellomycota), and found no support to reject a polytomy. Finally, we explored the phylogenetic relationships within Glomeromycota. Our results support family level classification from previous phylogenetic studies, and the polyphyly of the order Glomerales with Claroideoglomeraceae as the sister group to Glomeraceae and Diversisporales.more » « less