skip to main content


Title: Recent loss of the Dim2 DNA methyltransferase decreases mutation rate in repeats and changes evolutionary trajectory in a fungal pathogen
DNA methylation is found throughout all domains of life, yet the extent and function of DNA methylation differ among eukaryotes. Strains of the plant pathogenic fungus Zymoseptoria tritici appeared to lack cytosine DNA methylation (5mC) because gene amplification followed by Repeat-Induced Point mutation (RIP) resulted in the inactivation of the dim2 DNA methyltransferase gene. 5mC is, however, present in closely related sister species. We demonstrate that inactivation of dim2 occurred recently as some Z . tritici isolates carry a functional dim2 gene. Moreover, we show that dim2 inactivation occurred by a different path than previously hypothesized. We mapped the genome-wide distribution of 5mC in strains with or without functional dim2 alleles. Presence of functional dim2 correlates with high levels of 5mC in transposable elements (TEs), suggesting a role in genome defense. We identified low levels of 5mC in strains carrying non-functional dim2 alleles, suggesting that 5mC is maintained over time, presumably by an active Dnmt5 DNA methyltransferase. Integration of a functional dim2 allele in strains with mutated dim2 restored normal 5mC levels, demonstrating de novo cytosine methylation activity of Dim2. To assess the importance of 5mC for genome evolution, we performed an evolution experiment, comparing genomes of strains with high levels of 5mC to genomes of strains lacking functional dim2 . We found that presence of a functional dim2 allele alters nucleotide composition by promoting C to T transitions (C→T) specifically at CpA (CA) sites during mitosis, likely contributing to TE inactivation. Our results show that 5mC density at TEs is a polymorphic trait in Z . tritici populations that can impact genome evolution.  more » « less
Award ID(s):
1818006
NSF-PAR ID:
10240288
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Editor(s):
Krasileva, Ksenia
Date Published:
Journal Name:
PLOS Genetics
Volume:
17
Issue:
3
ISSN:
1553-7404
Page Range / eLocation ID:
e1009448
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. SUMMARY

    The DOMAINS REARRANGED METHYLTRANSFERASEs (DRMs) are crucial for RNA‐directed DNA methylation (RdDM) in plant species.Setaria viridisis a model monocot species with a relatively compact genome that has limited transposable element (TE) content. CRISPR‐based genome editing approaches were used to create loss‐of‐function alleles for the two putative functional DRM genes inS. viridisto probe the role of RdDM. Double mutant (drm1ab)plants exhibit some morphological abnormalities but are fully viable. Whole‐genome methylation profiling provided evidence for the widespread loss of methylation in CHH sequence contexts, particularly in regions with high CHH methylation in wild‐type plants. Evidence was also found for the locus‐specific loss of CG and CHG methylation, even in some regions that lack CHH methylation. Transcriptome profiling identified genes with altered expression in thedrm1abmutants. However, the majority of genes with high levels of CHH methylation directly surrounding the transcription start site or in nearby promoter regions in wild‐type plants do not have altered expression in thedrm1abmutant, even when this methylation is lost, suggesting limited regulation of gene expression by RdDM. Detailed analysis of the expression of TEs identified several transposons that are transcriptionally activated indrm1abmutants. These transposons are likely to require active RdDM for the maintenance of transcriptional repression.

     
    more » « less
  2. INTRODUCTION Transposable elements (TEs), repeat expansions, and repeat-mediated structural rearrangements play key roles in chromosome structure and species evolution, contribute to human genetic variation, and substantially influence human health through copy number variants, structural variants, insertions, deletions, and alterations to gene transcription and splicing. Despite their formative role in genome stability, repetitive regions have been relegated to gaps and collapsed regions in human genome reference GRCh38 owing to the technological limitations during its development. The lack of linear sequence in these regions, particularly in centromeres, resulted in the inability to fully explore the repeat content of the human genome in the context of both local and regional chromosomal environments. RATIONALE Long-read sequencing supported the complete, telomere-to-telomere (T2T) assembly of the pseudo-haploid human cell line CHM13. This resource affords a genome-scale assessment of all human repetitive sequences, including TEs and previously unknown repeats and satellites, both within and outside of gaps and collapsed regions. Additionally, a complete genome enables the opportunity to explore the epigenetic and transcriptional profiles of these elements that are fundamental to our understanding of chromosome structure, function, and evolution. Comparative analyses reveal modes of repeat divergence, evolution, and expansion or contraction with locus-level resolution. RESULTS We implemented a comprehensive repeat annotation workflow using previously known human repeats and de novo repeat modeling followed by manual curation, including assessing overlaps with gene annotations, segmental duplications, tandem repeats, and annotated repeats. Using this method, we developed an updated catalog of human repetitive sequences and refined previous repeat annotations. We discovered 43 previously unknown repeats and repeat variants and characterized 19 complex, composite repetitive structures, which often carry genes, across T2T-CHM13. Using precision nuclear run-on sequencing (PRO-seq) and CpG methylated sites generated from Oxford Nanopore Technologies long-read sequencing data, we assessed RNA polymerase engagement across retroelements genome-wide, revealing correlations between nascent transcription, sequence divergence, CpG density, and methylation. These analyses were extended to evaluate RNA polymerase occupancy for all repeats, including high-density satellite repeats that reside in previously inaccessible centromeric regions of all human chromosomes. Moreover, using both mapping-dependent and mapping-independent approaches across early developmental stages and a complete cell cycle time series, we found that engaged RNA polymerase across satellites is low; in contrast, TE transcription is abundant and serves as a boundary for changes in CpG methylation and centromere substructure. Together, these data reveal the dynamic relationship between transcriptionally active retroelement subclasses and DNA methylation, as well as potential mechanisms for the derivation and evolution of new repeat families and composite elements. Focusing on the emerging T2T-level assembly of the HG002 X chromosome, we reveal that a high level of repeat variation likely exists across the human population, including composite element copy numbers that affect gene copy number. Additionally, we highlight the impact of repeats on the structural diversity of the genome, revealing repeat expansions with extreme copy number differences between humans and primates while also providing high-confidence annotations of retroelement transduction events. CONCLUSION The comprehensive repeat annotations and updated repeat models described herein serve as a resource for expanding the compendium of human genome sequences and reveal the impact of specific repeats on the human genome. In developing this resource, we provide a methodological framework for assessing repeat variation within and between human genomes. The exhaustive assessment of the transcriptional landscape of repeats, at both the genome scale and locally, such as within centromeres, sets the stage for functional studies to disentangle the role transcription plays in the mechanisms essential for genome stability and chromosome segregation. Finally, our work demonstrates the need to increase efforts toward achieving T2T-level assemblies for nonhuman primates and other species to fully understand the complexity and impact of repeat-derived genomic innovations that define primate lineages, including humans. Telomere-to-telomere assembly of CHM13 supports repeat annotations and discoveries. The human reference T2T-CHM13 filled gaps and corrected collapsed regions (triangles) in GRCh38. Combining long read–based methylation calls, PRO-seq, and multilevel computational methods, we provide a compendium of human repeats, define retroelement expression and methylation profiles, and delineate locus-specific sites of nascent transcription genome-wide, including previously inaccessible centromeres. SINE, short interspersed element; SVA, SINE–variable number tandem repeat– Alu ; LINE, long interspersed element; LTR, long terminal repeat; TSS, transcription start site; pA, xxxxxxxxxxxxxxxx. 
    more » « less
  3. Abstract Background The function of DNA methyltransferase genes of insects is a puzzle, because an association between gene expression and methylation is not universal for insects. If the genes normally involved in cytosine methylation are not influencing gene expression, what might be their role? We previously demonstrated that gametogenesis of Oncopeltus fasciatus is interrupted at meiosis following knockdown of DNA methyltransferase 1 ( Dnmt1 ) and this is unrelated to changes in levels of cytosine methylation. Here, using transcriptomics, we tested the hypothesis that Dmnt1 is a part of the meiotic gene pathway. Testes, which almost exclusively contain gametes at varying stages of development, were sampled at 7 days and 14 days following knockdown of Dmnt1 using RNAi. Results Using microscopy, we found actively dividing spermatocysts were reduced at both timepoints. However, as with other studies, we saw Dnmt1 knockdown resulted in condensed nuclei after mitosis–meiosis transition, and then cellular arrest. We found limited support for a functional role for Dnmt1 in our predicted cell cycle and meiotic pathways. An examination of a priori Gene Ontology terms showed no enrichment for meiosis. We then used the full data set to reveal further candidate pathways influenced by Dnmt1 for further hypotheses. Very few genes were differentially expressed at 7 days, but nearly half of all transcribed genes were differentially expressed at 14 days. We found no strong candidate pathways for how Dnmt1 knockdown was achieving its effect through Gene Ontology term overrepresentation analysis. Conclusions We, therefore, suggest that Dmnt1 plays a role in chromosome dynamics based on our observations of condensed nuclei and cellular arrest with no specific molecular pathways disrupted. 
    more » « less
  4. Lerat, Emmanuelle (Ed.)
    Abstract Methylated CHH (mCHH) islands are peaks of CHH methylation that occur primarily upstream to genes. These regions are actively targeted by the methylation machinery, occur at boundaries between heterochromatin and euchromatin, and tend to be near highly expressed genes. Here we took an evolutionary perspective by studying upstream mCHH islands across a sample of eight grass species. Using a statistical approach to define mCHH islands as regions that differ from genome-wide background CHH methylation levels, we demonstrated that mCHH islands are common and associate with 39% of genes, on average. We hypothesized that islands should be more frequent in genomes of large size, because they have more heterochromatin and hence more need for defined boundaries. We found, however, that smaller genomes tended to have a higher proportion of genes associated with 5′ mCHH islands. Consistent with previous work suggesting that islands reflect the silencing of the edge of transposable elements (TEs), genes with nearby TEs were more likely to have mCHH islands. However, the presence of mCHH islands was not a function solely of TEs, both because the underlying sequences of islands were often not homologous to TEs and because genic properties also predicted the presence of 5′ mCHH islands. These genic properties included length and gene-body methylation (gbM); in fact, in three of eight species, the absence of gbM was a stronger predictor of a 5′ mCHH island than TE proximity. In contrast, gene expression level was a positive but weak predictor of the presence of an island. Finally, we assessed whether mCHH islands were evolutionarily conserved by focusing on a set of 2,720 orthologs across the eight species. They were generally not conserved across evolutionary time. Overall, our data establish additional genic properties that are associated with mCHH islands and suggest that they are not just a consequence of the TE silencing machinery. 
    more » « less
  5. Restriction–modification (RM) systems in bacteria are implicated in multiple biological roles ranging from defense against parasitic genetic elements, to selfish addiction cassettes, and barriers to gene transfer and lineage homogenization. In bacteria, DNA-methylation without cognate restriction also plays important roles in DNA replication, mismatch repair, protein expression, and in biasing DNA uptake. Little is known about archaeal RM systems and DNA methylation. To elucidate further understanding for the role of RM systems and DNA methylation in Archaea, we undertook a survey of the presence of RM system genes and related genes, including orphan DNA methylases, in the halophilic archaeal class Halobacteria. Our results reveal that some orphan DNA methyltransferase genes were highly conserved among lineages indicating an important functional constraint, whereas RM systems demonstrated patchy patterns of presence and absence. This irregular distribution is due to frequent horizontal gene transfer and gene loss, a finding suggesting that the evolution and life cycle of RM systems may be best described as that of a selfish genetic element. A putative target motif (CTAG) of one of the orphan methylases was underrepresented in all of the analyzed genomes, whereas another motif (GATC) was overrepresented in most of the haloarchaeal genomes, particularly in those that encoded the cognate orphan methylase. 
    more » « less