skip to main content


Title: Opinion: Genetic Conflict With Mobile Elements Drives Eukaryotic Genome Evolution, and Perhaps Also Eukaryogenesis
Abstract Through analyses of diverse microeukaryotes, we have previously argued that eukaryotic genomes are dynamic systems that rely on epigenetic mechanisms to distinguish germline (i.e., DNA to be inherited) from soma (i.e., DNA that undergoes polyploidization, genome rearrangement, etc.), even in the context of a single nucleus. Here, we extend these arguments by including two well-documented observations: (1) eukaryotic genomes interact frequently with mobile genetic elements (MGEs) like viruses and transposable elements (TEs), creating genetic conflict, and (2) epigenetic mechanisms regulate MGEs. Synthesis of these ideas leads to the hypothesis that genetic conflict with MGEs contributed to the evolution of a dynamic eukaryotic genome in the last eukaryotic common ancestor (LECA), and may have contributed to eukaryogenesis (i.e., may have been a driver in the evolution of FECA, the first eukaryotic common ancestor). Sex (i.e., meiosis) may have evolved within the context of the development of germline–soma distinctions in LECA, as this process resets the germline genome by regulating/eliminating somatic (i.e., polyploid, rearranged) genetic material. Our synthesis of these ideas expands on hypotheses of the origin of eukaryotes by integrating the roles of MGEs and epigenetics.  more » « less
Award ID(s):
1924570 1651908 1541511
NSF-PAR ID:
10249336
Author(s) / Creator(s):
;
Editor(s):
Orive, Maria
Date Published:
Journal Name:
Journal of Heredity
Volume:
112
Issue:
1
ISSN:
0022-1503
Page Range / eLocation ID:
140 to 144
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Mobile genetic elements (MGEs) are transient genetic material that can move either within a single organism's genome or between individuals or species. While historically considered “junk” DNA (i.e., deleterious or at best neutral), more recent studies reveal the potential adaptive advantages MGEs provide in lineages across the tree of life. Ciliates, a group of single‐celled microbial eukaryotes characterized by nuclear dimorphism, exemplify how epigenetic influences from MGEs shape genome architecture and patterns of molecular evolution. Ciliate nuclear dimorphism may have evolved as a response to transposon invasion and ciliates have since co‐opted transposons to carry out programmed DNA deletion. Another example of the effect of MGEs is in providing mechanisms for lateral gene transfer (LGT) from bacteria, which introduces genetic diversity and, in several cases, may drive ecological specialization in ciliates. As a third example, the integration of viral DNA, likely through transduction, provides new genetic materials and can change the way host cells defend themselves against other viral pathogens. We argue that the acquisition of MGEs through non‐Mendelian patterns of inheritance, coupled with their effects on ciliate genome architecture and persistence throughout evolutionary history, exemplify how the transmission of mobile elements should be considered a mechanism of transgenerational epigenetic inheritance.

     
    more » « less
  2. Archibald, John (Ed.)
    Abstract Epigenetic processes in eukaryotes play important roles through regulation of gene expression, chromatin structure, and genome rearrangements. The roles of chromatin modification (e.g., DNA methylation and histone modification) and non-protein-coding RNAs have been well studied in animals and plants. With the exception of a few model organisms (e.g., Saccharomyces and Plasmodium), much less is known about epigenetic toolkits across the remainder of the eukaryotic tree of life. Even with limited data, previous work suggested the existence of an ancient epigenetic toolkit in the last eukaryotic common ancestor. We use PhyloToL, our taxon-rich phylogenomic pipeline, to detect homologs of epigenetic genes and evaluate their macroevolutionary patterns among eukaryotes. In addition to data from GenBank, we increase taxon sampling from understudied clades of SAR (Stramenopila, Alveolata, and Rhizaria) and Amoebozoa by adding new single-cell transcriptomes from ciliates, foraminifera, and testate amoebae. We focus on 118 gene families, 94 involved in chromatin modification and 24 involved in non-protein-coding RNA processes based on the epigenetics literature. Our results indicate 1) the presence of a large number of epigenetic gene families in the last eukaryotic common ancestor; 2) differential conservation among major eukaryotic clades, with a notable paucity of genes within Excavata; and 3) punctate distribution of epigenetic gene families between species consistent with rapid evolution leading to gene loss. Together these data demonstrate the power of taxon-rich phylogenomic studies for illuminating evolutionary patterns at scales of >1 billion years of evolution and suggest that macroevolutionary phenomena, such as genome conflict, have shaped the evolution of the eukaryotic epigenetic toolkit. 
    more » « less
  3. INTRODUCTION Transposable elements (TEs), repeat expansions, and repeat-mediated structural rearrangements play key roles in chromosome structure and species evolution, contribute to human genetic variation, and substantially influence human health through copy number variants, structural variants, insertions, deletions, and alterations to gene transcription and splicing. Despite their formative role in genome stability, repetitive regions have been relegated to gaps and collapsed regions in human genome reference GRCh38 owing to the technological limitations during its development. The lack of linear sequence in these regions, particularly in centromeres, resulted in the inability to fully explore the repeat content of the human genome in the context of both local and regional chromosomal environments. RATIONALE Long-read sequencing supported the complete, telomere-to-telomere (T2T) assembly of the pseudo-haploid human cell line CHM13. This resource affords a genome-scale assessment of all human repetitive sequences, including TEs and previously unknown repeats and satellites, both within and outside of gaps and collapsed regions. Additionally, a complete genome enables the opportunity to explore the epigenetic and transcriptional profiles of these elements that are fundamental to our understanding of chromosome structure, function, and evolution. Comparative analyses reveal modes of repeat divergence, evolution, and expansion or contraction with locus-level resolution. RESULTS We implemented a comprehensive repeat annotation workflow using previously known human repeats and de novo repeat modeling followed by manual curation, including assessing overlaps with gene annotations, segmental duplications, tandem repeats, and annotated repeats. Using this method, we developed an updated catalog of human repetitive sequences and refined previous repeat annotations. We discovered 43 previously unknown repeats and repeat variants and characterized 19 complex, composite repetitive structures, which often carry genes, across T2T-CHM13. Using precision nuclear run-on sequencing (PRO-seq) and CpG methylated sites generated from Oxford Nanopore Technologies long-read sequencing data, we assessed RNA polymerase engagement across retroelements genome-wide, revealing correlations between nascent transcription, sequence divergence, CpG density, and methylation. These analyses were extended to evaluate RNA polymerase occupancy for all repeats, including high-density satellite repeats that reside in previously inaccessible centromeric regions of all human chromosomes. Moreover, using both mapping-dependent and mapping-independent approaches across early developmental stages and a complete cell cycle time series, we found that engaged RNA polymerase across satellites is low; in contrast, TE transcription is abundant and serves as a boundary for changes in CpG methylation and centromere substructure. Together, these data reveal the dynamic relationship between transcriptionally active retroelement subclasses and DNA methylation, as well as potential mechanisms for the derivation and evolution of new repeat families and composite elements. Focusing on the emerging T2T-level assembly of the HG002 X chromosome, we reveal that a high level of repeat variation likely exists across the human population, including composite element copy numbers that affect gene copy number. Additionally, we highlight the impact of repeats on the structural diversity of the genome, revealing repeat expansions with extreme copy number differences between humans and primates while also providing high-confidence annotations of retroelement transduction events. CONCLUSION The comprehensive repeat annotations and updated repeat models described herein serve as a resource for expanding the compendium of human genome sequences and reveal the impact of specific repeats on the human genome. In developing this resource, we provide a methodological framework for assessing repeat variation within and between human genomes. The exhaustive assessment of the transcriptional landscape of repeats, at both the genome scale and locally, such as within centromeres, sets the stage for functional studies to disentangle the role transcription plays in the mechanisms essential for genome stability and chromosome segregation. Finally, our work demonstrates the need to increase efforts toward achieving T2T-level assemblies for nonhuman primates and other species to fully understand the complexity and impact of repeat-derived genomic innovations that define primate lineages, including humans. Telomere-to-telomere assembly of CHM13 supports repeat annotations and discoveries. The human reference T2T-CHM13 filled gaps and corrected collapsed regions (triangles) in GRCh38. Combining long read–based methylation calls, PRO-seq, and multilevel computational methods, we provide a compendium of human repeats, define retroelement expression and methylation profiles, and delineate locus-specific sites of nascent transcription genome-wide, including previously inaccessible centromeres. SINE, short interspersed element; SVA, SINE–variable number tandem repeat– Alu ; LINE, long interspersed element; LTR, long terminal repeat; TSS, transcription start site; pA, xxxxxxxxxxxxxxxx. 
    more » « less
  4. Bomblies, K (Ed.)
    Abstract Transposable elements (TEs) have the potential to create regulatory variation both through the disruption of existing DNA regulatory elements and through the creation of novel DNA regulatory elements. In a species with a large genome, such as maize, many TEs interspersed with genes create opportunities for significant allelic variation due to TE presence/absence polymorphisms among individuals. We used information on putative regulatory elements in combination with knowledge about TE polymorphisms in maize to identify TE insertions that interrupt existing accessible chromatin regions (ACRs) in B73 as well as examples of polymorphic TEs that contain ACRs among four inbred lines of maize including B73, Mo17, W22, and PH207. The TE insertions in three other assembled maize genomes (Mo17, W22, or PH207) that interrupt ACRs that are present in the B73 genome can trigger changes to the chromatin, suggesting the potential for both genetic and epigenetic influences of these insertions. Nearly 20% of the ACRs located over 2 kb from the nearest gene are located within an annotated TE. These are regions of unmethylated DNA that show evidence for functional importance similar to ACRs that are not present within TEs. Using a large panel of maize genotypes, we tested if there is an association between the presence of TE insertions that interrupt, or carry, an ACR and the expression of nearby genes. While most TE polymorphisms are not associated with expression for nearby genes, the TEs that carry ACRs exhibit enrichment for being associated with higher expression of nearby genes, suggesting that these TEs may contribute novel regulatory elements. These analyses highlight the potential for a subset of TEs to rewire transcriptional responses in eukaryotic genomes. 
    more » « less
  5. Haplodiploidy and paternal genome elimination (HD/PGE) are common in invertebrates, having evolved at least two dozen times, all from male heterogamety (i.e., systems with X chromosomes). However, why X chromosomes are important for the evolution of HD/PGE remains debated. The Haploid Viability Hypothesis posits that X-linked genes promote the evolution of male haploidy by facilitating purging recessive deleterious mutations. The Intragenomic Conflict Hypothesis holds that conflict between genes drives genetic system turnover; under this model, X-linked genes could promote the evolution of male haploidy due to conflicts with autosomes over sex ratios and genetic transmission. We studied lineages where we can distinguish these hypotheses: species with germline PGE that retain an XX/X0 sex determination system (gPGE+X). Because evolving PGE in these cases involves changes in transmission without increases in male hemizygosity, a high degree of X linkage in these systems is predicted by the Intragenomic Conflict Hypothesis but not the Haploid Viability Hypothesis. To quantify the degree of X linkage, we sequenced and compared 7 gPGE+X species’ genomes with 11 related species with typical XX/XY or XX/X0 genetic systems, representing three transitions to gPGE. We find highly increased X linkage in both modern and ancestral genomes of gPGE+X species compared to non-gPGE relatives and recover a significant positive correlation between percent X linkage and the evolution of gPGE. These empirical results substantiate longstanding proposals for a role for intragenomic conflict in the evolution of genetic systems such as HD/PGE. 
    more » « less