skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Phylogenomic Analyses of 2,786 Genes in 158 Lineages Support a Root of the Eukaryotic Tree of Life between Opisthokonts and All Other Lineages
Abstract Advances in phylogenomics and high-throughput sequencing have allowed the reconstruction of deep phylogenetic relationships in the evolution of eukaryotes. Yet, the root of the eukaryotic tree of life remains elusive. The most popular hypothesis in textbooks and reviews is a root between Unikonta (Opisthokonta + Amoebozoa) and Bikonta (all other eukaryotes), which emerged from analyses of a single-gene fusion. Subsequent, highly cited studies based on concatenation of genes supported this hypothesis with some variations or proposed a root within Excavata. However, concatenation of genes does not consider phylogenetically-informative events like gene duplications and losses. A recent study using gene tree parsimony (GTP) suggested the root lies between Opisthokonta and all other eukaryotes, but only including 59 taxa and 20 genes. Here we use GTP with a duplication-loss model in a gene-rich and taxon-rich dataset (i.e., 2,786 gene families from two sets of 155 and 158 diverse eukaryotic lineages) to assess the root, and we iterate each analysis 100 times to quantify tree space uncertainty. We also contrasted our results and discarded alternative hypotheses from the literature using GTP and the likelihood-based method SpeciesRax. Our estimates suggest a root between Fungi or Opisthokonta and all other eukaryotes; but based on further analysis of genome size, we propose that the root between Opisthokonta and all other eukaryotes is the most likely.  more » « less
Award ID(s):
1924570
PAR ID:
10356835
Author(s) / Creator(s):
; ; ; ;
Editor(s):
Phadke, Sujal
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
14
Issue:
8
ISSN:
1759-6653
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Archibald, John (Ed.)
    Abstract Epigenetic processes in eukaryotes play important roles through regulation of gene expression, chromatin structure, and genome rearrangements. The roles of chromatin modification (e.g., DNA methylation and histone modification) and non-protein-coding RNAs have been well studied in animals and plants. With the exception of a few model organisms (e.g., Saccharomyces and Plasmodium), much less is known about epigenetic toolkits across the remainder of the eukaryotic tree of life. Even with limited data, previous work suggested the existence of an ancient epigenetic toolkit in the last eukaryotic common ancestor. We use PhyloToL, our taxon-rich phylogenomic pipeline, to detect homologs of epigenetic genes and evaluate their macroevolutionary patterns among eukaryotes. In addition to data from GenBank, we increase taxon sampling from understudied clades of SAR (Stramenopila, Alveolata, and Rhizaria) and Amoebozoa by adding new single-cell transcriptomes from ciliates, foraminifera, and testate amoebae. We focus on 118 gene families, 94 involved in chromatin modification and 24 involved in non-protein-coding RNA processes based on the epigenetics literature. Our results indicate 1) the presence of a large number of epigenetic gene families in the last eukaryotic common ancestor; 2) differential conservation among major eukaryotic clades, with a notable paucity of genes within Excavata; and 3) punctate distribution of epigenetic gene families between species consistent with rapid evolution leading to gene loss. Together these data demonstrate the power of taxon-rich phylogenomic studies for illuminating evolutionary patterns at scales of >1 billion years of evolution and suggest that macroevolutionary phenomena, such as genome conflict, have shaped the evolution of the eukaryotic epigenetic toolkit. 
    more » « less
  2. Eukaryotic diversity is largely microbial, with macroscopic lineages (plant, animals and fungi) nesting among a plethora of diverse protists. Understanding the evolutionary relationships among eukaryotes is rapidly advancing through omics analyses, but phylogenomics are challenging for microeukaryotes, particularly uncultivable lineages, as single-cell sequencing approaches generate a mixture of sequences from hosts, associated microbiomes, and contaminants. Moreover, many analyses of eukaryotic gene families and phylogenies rely on boutique datasets and methods that are challenging for other research groups to replicate. To address these challenges, we present EukPhylo v1.0, a modular, user-friendly pipeline that enables effective data curation through phylogeny-informed contamination removal, estimation of homologous gene families (GFs), and generation of both multisequence alignments and gene trees. Analyses can use a hook database of ~15k ancient GFs or users can easily replace this hook with a set of gene families of interest. We demonstrate the power of EukPhylo, including a suite of stand-alone utilities, through analyses of 500 conserved GFs sampled from 1,000 diverse species of eukaryotes, bacteria and archaea. We show improvements in estimates of the eukaryotic tree of life, recovering clades that are well established in the literature, through successive rounds of curation using the EukPhylo contamination loop. The final trees corroborate numerous hypotheses in the literature (e.g. Opisthokonta, Rhizaria, Amoebozoa) while challenging others (e.g. CRuMs, Obazoa, Diaphoretickes). We believe that the flexibility and transparency of EukPhylo sets standards for curation of omics data for future studies. 
    more » « less
  3. Abstract The details surrounding the early evolution of eukaryotes and their viruses are largely unknown. Several key enzymes involved in DNA synthesis and transcription are shared between eukaryotes and large DNA viruses in the phylumNucleocytoviricota, but the evolutionary relationships between these genes remain unclear. In particular, previous studies of eukaryotic DNA and RNA polymerases often show deep-branching clades of eukaryotes and viruses indicative of ancient gene exchange. Here, we performed updated phylogenetic analysis of eukaryotic and viral family B DNA polymerases, multimeric RNA polymerases, and mRNA-capping enzymes to explore their evolutionary relationships. Our results show that viral enzymes form clades that are typically adjacent to eukaryotes, suggesting that they originate prior to the emergence of the Last Eukaryotic Common Ancestor (LECA). The machinery for viral DNA replication, transcription, and mRNA capping are all key processes needed for the maintenance of virus factories, which are complex structures formed by many nucleocytoviruses during infection, indicating that viruses capable of making these structures are ancient. These findings hint at a diverse and complex pre-LECA virosphere and indicate that large DNA viruses may encode proteins that are relics of extinct proto-eukaryotic lineages. 
    more » « less
  4. Animals use geomagnetic fields for navigational cues, yet the sensory mechanism underlying magnetic perception remains poorly understood. One idea is that geomagnetic fields are physically transduced by magnetite crystals contained inside specialized receptor cells, but evidence for intracellular, biogenic magnetite in eukaryotes is scant. Certain bacteria produce magnetite crystals inside intracellular compartments, representing the most ancient form of biomineralization known and having evolved prior to emergence of the crown group of eukaryotes, raising the question of whether magnetite biomineralization in eukaryotes and prokaryotes might share a common evolutionary history. Here, we discover that salmonid olfactory epithelium contains magnetite crystals arranged in compact clusters and determine that genes differentially expressed in magnetic olfactory cells, contrasted to nonmagnetic olfactory cells, share ancestry with an ancient prokaryote magnetite biomineralization system, consistent with exaptation for use in eukaryotic magnetoreception. We also show that 11 prokaryote biomineralization genes are universally present among a diverse set of eukaryote taxa and that nine of those genes are present within the Asgard clade of archaea Lokiarchaeota that affiliates with eukaryotes in phylogenomic analysis. Consistent with deep homology, we present an evolutionary genetics hypothesis for magnetite formation among eukaryotes to motivate convergent approaches for examining magnetite-based magnetoreception, molecular origins of matrix-associated biomineralization processes, and eukaryogenesis. 
    more » « less
  5. Laub, Michael T (Ed.)
    Animals use a variety of cell-autonomous innate immune proteins to detect viral infections and prevent replication. Recent studies have discovered that a subset of mammalian antiviral proteins have homology to antiphage defense proteins in bacteria, implying that there are aspects of innate immunity that are shared across the Tree of Life. While the majority of these studies have focused on characterizing the diversity and biochemical functions of the bacterial proteins, the evolutionary relationships between animal and bacterial proteins are less clear. This ambiguity is partly due to the long evolutionary distances separating animal and bacterial proteins, which obscures their relationships. Here, we tackle this problem for 3 innate immune families (CD-NTases [including cGAS], STINGs, and viperins) by deeply sampling protein diversity across eukaryotes. We find that viperins and OAS family CD-NTases are ancient immune proteins, likely inherited since the earliest eukaryotes first arose. In contrast, we find other immune proteins that were acquired via at least 4 independent events of horizontal gene transfer (HGT) from bacteria. Two of these events allowed algae to acquire new bacterial viperins, while 2 more HGT events gave rise to distinct superfamilies of eukaryotic CD-NTases: the cGLR superfamily (containing cGAS) that has since diversified via a series of animal-specific duplications and a previously undefined eSMODS superfamily, which more closely resembles bacterial CD-NTases. Finally, we found that cGAS and STING proteins have substantially different histories, with STING protein domains undergoing convergent domain shuffling in bacteria and eukaryotes. Overall, our findings paint a picture of eukaryotic innate immunity as highly dynamic, where eukaryotes build upon their ancient antiviral repertoires through the reuse of protein domains and by repeatedly sampling a rich reservoir of bacterial antiphage genes. 
    more » « less