skip to main content


Title: Natural variation in DNA methylation homeostasis and the emergence of epialleles

In plants and mammals, DNA methylation plays a critical role in transcriptional silencing by delineating heterochromatin from transcriptionally active euchromatin. A homeostatic balance between heterochromatin and euchromatin is essential to genomic stability. This is evident in many diseases and mutants for heterochromatin maintenance, which are characterized by global losses of DNA methylation coupled with localized ectopic gains of DNA methylation that alter transcription. Furthermore, we have shown that genome-wide methylation patterns inArabidopsis thalianaare highly stable over generations, with the exception of rare epialleles. However, the extent to which natural variation in the robustness of targeting DNA methylation to heterochromatin exists, and the phenotypic consequences of such variation, remain to be fully explored. Here we describe the finding that heterochromatin and genic DNA methylation are highly variable among 725A. thalianaaccessions. We found that genic DNA methylation is inversely correlated with that in heterochromatin, suggesting that certain methylation pathway(s) may be redirected to genes upon the loss of heterochromatin. This redistribution likely involves a feedback loop involving the DNA methyltransferase, CHROMOMETHYLASE 3 (CMT3), H3K9me2, and histone turnover, as highly expressed, long genes with a high density of CMT3-preferred CWG sites are more likely to be methylated. Importantly, although the presence of CG methylation in genes alone may not affect transcription, genes containing CG methylation are more likely to become methylated at non-CG sites and silenced. These findings are consistent with the hypothesis that natural variation in DNA methylation homeostasis may underlie the evolution of epialleles that alter phenotypes.

 
more » « less
Award ID(s):
1856143
NSF-PAR ID:
10135233
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Proceedings of the National Academy of Sciences
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
117
Issue:
9
ISSN:
0027-8424
Page Range / eLocation ID:
p. 4874-4884
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Gene duplication is a source of evolutionary novelty. DNA methylation may play a role in the evolution of duplicate genes (paralogs) through its association with gene expression. While this relationship has been examined to varying extents in a few individual species, the generalizability of these results at either a broad phylogenetic scale with species of differing duplication histories or across a population remains unknown. We applied a comparative epigenomic approach to 43 angiosperm species across the phylogeny and a population of 928 Arabidopsis (Arabidopsis thaliana) accessions, examining the association of DNA methylation with paralog evolution. Genic DNA methylation was differentially associated with duplication type, the age of duplication, sequence evolution, and gene expression. Whole-genome duplicates were typically enriched for CG-only gene body methylated or unmethylated genes, while single-gene duplications were typically enriched for non-CG methylated or unmethylated genes. Non-CG methylation, in particular, was a characteristic of more recent single-gene duplicates. Core angiosperm gene families were differentiated into those which preferentially retain paralogs and “duplication-resistant” families, which convergently reverted to singletons following duplication. Duplication-resistant families that still have paralogous copies were, uncharacteristically for core angiosperm genes, enriched for non-CG methylation. Non-CG methylated paralogs had higher rates of sequence evolution, higher frequency of presence–absence variation, and more limited expression. This suggests that silencing by non-CG methylation may be important to maintaining dosage following duplication and be a precursor to fractionation. Our results indicate that genic methylation marks differing evolutionary trajectories and fates between paralogous genes and have a role in maintaining dosage following duplication.

     
    more » « less
  2. In many plant species, a subset of transcribed genes are characterized by strictly CG-context DNA methylation, referred to as gene body methylation (gbM). The mechanisms that establish gbM are unclear, yet flowering plant species naturally without gbM lack the DNA methyltransferase, CMT3, which maintains CHG (H = A, C, or T) and not CG methylation at constitutive heterochromatin. Here, we identify the mechanistic basis for gbM establishment by expressing CMT3 in a species naturally lacking CMT3. CMT3 expression reconstituted gbM through a progression of de novo CHG methylation on expressed genes, followed by the accumulation of CG methylation that could be inherited even following loss of the CMT3 transgene. Thus, gbM likely originates from the simultaneous targeting of loci by pathways that promote euchromatin and heterochromatin, which primes genes for the formation of stably inherited epimutations in the form of CG DNA methylation. 
    more » « less
  3. Abstract Epialleles are meiotically heritable variations in expression states that are independent from changes in DNA sequence. Although they are common in plant genomes, their molecular origins are unknown. Here we show, using mutant and experimental populations, that epialleles in Arabidopsis thaliana that result from ectopic hypermethylation are due to feedback regulation of pathways that primarily function to maintain DNA methylation at heterochromatin. Perturbations to maintenance of heterochromatin methylation leads to feedback regulation of DNA methylation in genes. Using single base resolution methylomes from epigenetic recombinant inbred lines (epiRIL), we show that epiallelic variation is abundant in euchromatin, yet, associates with QTL primarily in heterochromatin regions. Mapping three-dimensional chromatin contacts shows that genes that are hotspots for ectopic hypermethylation have increases in contact frequencies with regions possessing H3K9me2. Altogether, these data show that feedback regulation of pathways that have evolved to maintain heterochromatin silencing leads to the origins of spontaneous hypermethylated epialleles. 
    more » « less
  4. Lerat, Emmanuelle (Ed.)
    Abstract Methylated CHH (mCHH) islands are peaks of CHH methylation that occur primarily upstream to genes. These regions are actively targeted by the methylation machinery, occur at boundaries between heterochromatin and euchromatin, and tend to be near highly expressed genes. Here we took an evolutionary perspective by studying upstream mCHH islands across a sample of eight grass species. Using a statistical approach to define mCHH islands as regions that differ from genome-wide background CHH methylation levels, we demonstrated that mCHH islands are common and associate with 39% of genes, on average. We hypothesized that islands should be more frequent in genomes of large size, because they have more heterochromatin and hence more need for defined boundaries. We found, however, that smaller genomes tended to have a higher proportion of genes associated with 5′ mCHH islands. Consistent with previous work suggesting that islands reflect the silencing of the edge of transposable elements (TEs), genes with nearby TEs were more likely to have mCHH islands. However, the presence of mCHH islands was not a function solely of TEs, both because the underlying sequences of islands were often not homologous to TEs and because genic properties also predicted the presence of 5′ mCHH islands. These genic properties included length and gene-body methylation (gbM); in fact, in three of eight species, the absence of gbM was a stronger predictor of a 5′ mCHH island than TE proximity. In contrast, gene expression level was a positive but weak predictor of the presence of an island. Finally, we assessed whether mCHH islands were evolutionarily conserved by focusing on a set of 2,720 orthologs across the eight species. They were generally not conserved across evolutionary time. Overall, our data establish additional genic properties that are associated with mCHH islands and suggest that they are not just a consequence of the TE silencing machinery. 
    more » « less
  5. Summary

    Processes affecting rates of sequence polymorphism are fundamental to the evolution of gene duplicates. The relationship between gene activity and sequence polymorphism can influence the likelihood that functionally redundant gene copies are co‐maintained in stable evolutionary equilibria vs other outcomes such as neofunctionalization.

    Here, we investigate genic variation in epigenome‐associated polymorphism rates inArabidopsis thalianaand consider whether these affect the evolution of gene duplicates. We compared the frequency of sequence polymorphism and patterns of genetic differentiation between genes classified by exon methylation patterns: unmethylated (unM), gene‐body methylated (gbM), and transposon‐like methylated (teM) states, which reflect divergence in gene expression.

    We found that the frequency of polymorphism was higher in teM (transcriptionally repressed, tissue‐specific) genes and lower in gbM (active, constitutively expressed) genes. Comparisons of gene duplicates were largely consistent with genome‐wide patterns – gene copies that exhibit teM accumulate more variation, evolve faster, and are in chromatin states associated with reduced DNA repair.

    This relationship between expression, the epigenome, and polymorphism may lead to the breakdown of equilibrium states that would otherwise maintain genetic redundancies. Epigenome‐mediated polymorphism rate variation may facilitate the evolution of novel gene functions in duplicate paralogs maintained over evolutionary time.

     
    more » « less