skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Convergent evolution of epigenome recruited DNA repair across the Tree of Life
Abstract Mutations fuel evolution while also causing diseases like cancer. Epigenome-targeted DNA repair can help organisms protect important genomic regions from mutation. However, the adaptive value, mechanistic diversity, and evolution of epigenome-targeted DNA repair systems across the tree of life remain unresolved. Here, we investigated the evolution of histone reader domains fused to the DNA repair protein MSH6 (MutS Homolog 6) across over 4,000 eukaryotes. We uncovered a paradigmatic example of convergent evolution: MSH6 has independently acquired distinct histone reader domains; PWWP (metazoa) and Tudor (plants), previously shown to target histone modifications in active genes in humans (H3K36me3) and Arabidopsis (H3K4me1). Conservation in MSH6 histone reader domains shows signatures of natural selection, particularly for amino acids that bind specific histone modifications. Species that have gained or retained MSH6 histone readers tend to have larger genome sizes, especially marked by significantly more introns in genic regions. These patterns support previous theoretical predictions about the co-evolution of genome architectures and mutation rate heterogeneity. The evolution of epigenome-targeted DNA repair has implications for genome evolution, health, and the mutational origins of genetic diversity across the tree of life.  more » « less
Award ID(s):
2338236
PAR ID:
10665863
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
eLife
Date Published:
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract DNA repair proteins can be recruited by their histone reader domains to specific epigenomic features, with consequences on intragenomic mutation rate variation. Here, we investigated H3K4me1-associated hypomutation in plants. We first examined 2 proteins which, in plants, contain Tudor histone reader domains: PRECOCIOUS DISSOCIATION OF SISTERS 5 (PDS5C), involved in homology-directed repair, and MUTS HOMOLOG 6 (MSH6), a mismatch repair protein. The MSH6 Tudor domain of Arabidopsis (Arabidopsis thaliana) binds to H3K4me1 as previously demonstrated for PDS5C, which localizes to H3K4me1-rich gene bodies and essential genes. Mutations revealed by ultradeep sequencing of wild-type and msh6 knockout lines in Arabidopsis show that functional MSH6 is critical for the reduced rate of single-base substitution (SBS) mutations in gene bodies and H3K4me1-rich regions. We explored the breadth of these mechanisms among plants by examining a large rice (Oryza sativa) mutation data set. H3K4me1-associated hypomutation is conserved in rice as are the H3K4me1-binding residues of MSH6 and PDS5C Tudor domains. Recruitment of DNA repair proteins by H3K4me1 in plants reveals convergent, but distinct, epigenome-recruited DNA repair mechanisms from those well described in humans. The emergent model of H3K4me1-recruited repair in plants is consistent with evolutionary theory regarding mutation modifier systems and offers mechanistic insight into intragenomic mutation rate variation in plants. 
    more » « less
  2. Komeili, Arash (Ed.)
    ABSTRACT Histone proteins are found across diverse lineages of Archaea , many of which package DNA and form chromatin. However, previous research has led to the hypothesis that the histone-like proteins of high-salt-adapted archaea, or halophiles, function differently. The sole histone protein encoded by the model halophilic species Halobacterium salinarum , HpyA, is nonessential and expressed at levels too low to enable genome-wide DNA packaging. Instead, HpyA mediates the transcriptional response to salt stress. Here we compare the features of genome-wide binding of HpyA to those of HstA, the sole histone of another model halophile, Haloferax volcanii . hstA , like hpyA , is a nonessential gene. To better understand HpyA and HstA functions, protein-DNA binding data (chromatin immunoprecipitation sequencing [ChIP-seq]) of these halophilic histones are compared to publicly available ChIP-seq data from DNA binding proteins across all domains of life, including transcription factors (TFs), nucleoid-associated proteins (NAPs), and histones. These analyses demonstrate that HpyA and HstA bind the genome infrequently in discrete regions, which is similar to TFs but unlike NAPs, which bind a much larger genomic fraction. However, unlike TFs that typically bind in intergenic regions, HpyA and HstA binding sites are located in both coding and intergenic regions. The genome-wide dinucleotide periodicity known to facilitate histone binding was undetectable in the genomes of both species. Instead, TF-like and histone-like binding sequence preferences were detected for HstA and HpyA, respectively. Taken together, these data suggest that halophilic archaeal histones are unlikely to facilitate genome-wide chromatin formation and that their function defies categorization as a TF, NAP, or histone. IMPORTANCE Most cells in eukaryotic species—from yeast to humans—possess histone proteins that pack and unpack DNA in response to environmental cues. These essential proteins regulate genes necessary for important cellular processes, including development and stress protection. Although the histone fold domain originated in the domain of life Archaea , the function of archaeal histone-like proteins is not well understood relative to those of eukaryotes. We recently discovered that, unlike histones of eukaryotes, histones in hypersaline-adapted archaeal species do not package DNA and can act as transcription factors (TFs) to regulate stress response gene expression. However, the function of histones across species of hypersaline-adapted archaea still remains unclear. Here, we compare hypersaline histone function to a variety of DNA binding proteins across the tree of life, revealing histone-like behavior in some respects and specific transcriptional regulatory function in others. 
    more » « less
  3. Abstract Owing to advances in genome sequencing, genome stability has become one of the most scrutinized cellular traits across the Tree of Life. Despite its centrality to all things biological, the mutation rate (per nucleotide site per generation) ranges over three orders of magnitude among species and several‐fold within individual phylogenetic lineages. Within all major organismal groups, mutation rates scale negatively with the effective population size of a species and with the amount of functional DNA in the genome. This relationship is most parsimoniously explained by the drift‐barrier hypothesis, which postulates that natural selection typically operates to reduce mutation rates until further improvement is thwarted by the power of random genetic drift. Despite this constraint, the molecular mechanisms underlying DNA replication fidelity and repair are free to wander, provided the performance of the entire system is maintained at the prevailing level. The evolutionary flexibility of the mutation rate bears on the resolution of several prior conundrums in phylogenetic and population‐genetic analysis and raises challenges for future applications in these areas. 
    more » « less
  4. null (Ed.)
    Epigenetic information affects gene function by interacting with chromatin, while not changing the DNA sequence itself. However, it has become apparent that the interactions between epigenetic information and chromatin can, in fact, indirectly lead to DNA mutations and ultimately influence genome evolution. This review evaluates the ways in which epigenetic information affects genome sequence and evolution. We discuss how DNA methylation has strong and pervasive effects on DNA sequence evolution in eukaryotic organisms. We also review how the physical interactions arising from the connections between histone proteins and DNA affect DNA mutation and repair. We then discuss how a variety of epigenetic mechanisms exert substantial effects on genome evolution by suppressing the movement of transposable elements. Finally, we examine how genome expansion through gene duplication is also partially controlled by epigenetic information. Overall, we conclude that epigenetic information has widespread indirect effects on DNA sequences in eukaryotes and represents a potent cause and constraint of genome evolution. This article is part of the theme issue ‘How does epigenetics influence the course of evolution?’ 
    more » « less
  5. Abstract DNA double-strand breaks require repair or risk corrupting the language of life. To ensure genome integrity and viability, multiple DNA double-strand break repair pathways function in eukaryotes. Two such repair pathways, canonical non-homologous end joining and homologous recombination, have been extensively studied, while other pathways such as microhomology-mediated end joint and single-strand annealing, once thought to serve as back-ups, now appear to play a fundamental role in DNA repair. Here, we review the molecular details and hierarchy of these four DNA repair pathways, and where possible, a comparison for what is known between animal and fungal models. We address the factors contributing to break repair pathway choice, and aim to explore our understanding and knowledge gaps regarding mechanisms and regulation in filamentous pathogens. We additionally discuss how DNA double-strand break repair pathways influence genome engineering results, including unexpected mutation outcomes. Finally, we review the concept of biased genome evolution in filamentous pathogens, and provide a model, termed Biased Variation, that links DNA double-strand break repair pathways with properties of genome evolution. Despite our extensive knowledge for this universal process, there remain many unanswered questions, for which the answers may improve genome engineering and our understanding of genome evolution. 
    more » « less