skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Phylogenomics of the Epigenetic Toolkit Reveals Punctate Retention of Genes across Eukaryotes
Abstract Epigenetic processes in eukaryotes play important roles through regulation of gene expression, chromatin structure, and genome rearrangements. The roles of chromatin modification (e.g., DNA methylation and histone modification) and non-protein-coding RNAs have been well studied in animals and plants. With the exception of a few model organisms (e.g., Saccharomyces and Plasmodium), much less is known about epigenetic toolkits across the remainder of the eukaryotic tree of life. Even with limited data, previous work suggested the existence of an ancient epigenetic toolkit in the last eukaryotic common ancestor. We use PhyloToL, our taxon-rich phylogenomic pipeline, to detect homologs of epigenetic genes and evaluate their macroevolutionary patterns among eukaryotes. In addition to data from GenBank, we increase taxon sampling from understudied clades of SAR (Stramenopila, Alveolata, and Rhizaria) and Amoebozoa by adding new single-cell transcriptomes from ciliates, foraminifera, and testate amoebae. We focus on 118 gene families, 94 involved in chromatin modification and 24 involved in non-protein-coding RNA processes based on the epigenetics literature. Our results indicate 1) the presence of a large number of epigenetic gene families in the last eukaryotic common ancestor; 2) differential conservation among major eukaryotic clades, with a notable paucity of genes within Excavata; and 3) punctate distribution of epigenetic gene families between species consistent with rapid evolution leading to gene loss. Together these data demonstrate the power of taxon-rich phylogenomic studies for illuminating evolutionary patterns at scales of >1 billion years of evolution and suggest that macroevolutionary phenomena, such as genome conflict, have shaped the evolution of the eukaryotic epigenetic toolkit.  more » « less
Award ID(s):
1924570 1651908
PAR ID:
10206838
Author(s) / Creator(s):
; ; ;
Editor(s):
Archibald, John
Date Published:
Journal Name:
Genome Biology and Evolution
Volume:
12
Issue:
12
ISSN:
1759-6653
Page Range / eLocation ID:
2196 to 2210
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Ouangraoua, Aida (Ed.)
    Abstract Previous evolutionary reconstructions have concluded that early eukaryotic ancestors including both the last common ancestor of eukaryotes and of all fungi had intron-rich genomes. By contrast, some extant eukaryotes have few introns, underscoring the complex histories of intron–exon structures, and raising the question as to why these few introns are retained. Here, we have used recently available fungal genomes to address a variety of questions related to intron evolution. Evolutionary reconstruction of intron presence and absence using 263 diverse fungal species supports the idea that massive intron reduction through intron loss has occurred in multiple clades. The intron densities estimated in various fungal ancestors differ from zero to 7.6 introns per 1 kb of protein-coding sequence. Massive intron loss has occurred not only in microsporidian parasites and saccharomycetous yeasts, but also in diverse smuts and allies. To investigate the roles of the remaining introns in highly-reduced species, we have searched for their special characteristics in eight intron-poor fungi. Notably, the introns of ribosome-associated genes RPL7 and NOG2 have conserved positions; both intron-containing genes encoding snoRNAs. Furthermore, both the proteins and snoRNAs are involved in ribosome biogenesis, suggesting that the expression of the protein-coding genes and noncoding snoRNAs may be functionally coordinated. Indeed, these introns are also conserved in three-quarters of fungi species. Our study shows that fungal introns have a complex evolutionary history and underappreciated roles in gene expression. 
    more » « less
  2. Abstract In contrast to the typified view of genome cycling only between haploidy and diploidy, there is evidence from across the tree of life of genome dynamics that alter both copy number (i.e. ploidy) and chromosome complements. Here, we highlight examples of such processes, including endoreplication, aneuploidy, inheritance of extrachromosomal DNA, and chromatin extrusion. Synthesizing data on eukaryotic genome dynamics in diverse extant lineages suggests the possibility that such processes were present before the last eukaryotic common ancestor. While present in some prokaryotes, these features appear exaggerated in eukaryotes where they are regulated by eukaryote-specific innovations including the nucleus, complex cytoskeleton, and synaptonemal complex. Based on these observations, we propose a model by which genome conflict drove the transformation of genomes during eukaryogenesis: from the origin of eukaryotes (i.e. first eukaryotic common ancestor) through the evolution of last eukaryotic common ancestor. 
    more » « less
  3. Orive, Maria (Ed.)
    Abstract Through analyses of diverse microeukaryotes, we have previously argued that eukaryotic genomes are dynamic systems that rely on epigenetic mechanisms to distinguish germline (i.e., DNA to be inherited) from soma (i.e., DNA that undergoes polyploidization, genome rearrangement, etc.), even in the context of a single nucleus. Here, we extend these arguments by including two well-documented observations: (1) eukaryotic genomes interact frequently with mobile genetic elements (MGEs) like viruses and transposable elements (TEs), creating genetic conflict, and (2) epigenetic mechanisms regulate MGEs. Synthesis of these ideas leads to the hypothesis that genetic conflict with MGEs contributed to the evolution of a dynamic eukaryotic genome in the last eukaryotic common ancestor (LECA), and may have contributed to eukaryogenesis (i.e., may have been a driver in the evolution of FECA, the first eukaryotic common ancestor). Sex (i.e., meiosis) may have evolved within the context of the development of germline–soma distinctions in LECA, as this process resets the germline genome by regulating/eliminating somatic (i.e., polyploid, rearranged) genetic material. Our synthesis of these ideas expands on hypotheses of the origin of eukaryotes by integrating the roles of MGEs and epigenetics. 
    more » « less
  4. Phadke, Sujal (Ed.)
    Abstract Advances in phylogenomics and high-throughput sequencing have allowed the reconstruction of deep phylogenetic relationships in the evolution of eukaryotes. Yet, the root of the eukaryotic tree of life remains elusive. The most popular hypothesis in textbooks and reviews is a root between Unikonta (Opisthokonta + Amoebozoa) and Bikonta (all other eukaryotes), which emerged from analyses of a single-gene fusion. Subsequent, highly cited studies based on concatenation of genes supported this hypothesis with some variations or proposed a root within Excavata. However, concatenation of genes does not consider phylogenetically-informative events like gene duplications and losses. A recent study using gene tree parsimony (GTP) suggested the root lies between Opisthokonta and all other eukaryotes, but only including 59 taxa and 20 genes. Here we use GTP with a duplication-loss model in a gene-rich and taxon-rich dataset (i.e., 2,786 gene families from two sets of 155 and 158 diverse eukaryotic lineages) to assess the root, and we iterate each analysis 100 times to quantify tree space uncertainty. We also contrasted our results and discarded alternative hypotheses from the literature using GTP and the likelihood-based method SpeciesRax. Our estimates suggest a root between Fungi or Opisthokonta and all other eukaryotes; but based on further analysis of genome size, we propose that the root between Opisthokonta and all other eukaryotes is the most likely. 
    more » « less
  5. Across eukaryotes, gene regulation is manifested via chromatin states roughly distinguished as heterochromatin and euchromatin. The establishment, maintenance, and modulation of the chromatin states is mediated using several factors including chromatin modifiers. However, factors that avoid the intrusion of silencing signals into protein-coding genes are poorly understood. Here we show that a plant specific paralog of RNA polymerase (Pol) II, named Pol IV, is involved in avoidance of facultative heterochromatic marks in protein-coding genes, in addition to its well-established functions in silencing repeats and transposons. In its absence, H3K27 trimethylation (me3) mark intruded the protein-coding genes, more profoundly in genes embedded with repeats. In a subset of genes, spurious transcriptional activity resulted in small(s) RNA production, leading to post-transcriptional gene silencing. We show that such effects are significantly pronounced in rice, a plant with a larger genome with distributed heterochromatin compared withArabidopsis. Our results indicate the division of labor among plant-specific polymerases, not just in establishing effective silencing via sRNAs and DNA methylation but also in influencing chromatin boundaries. 
    more » « less