skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Metagenome mining and functional analysis reveal oxidized guanine DNA repair at the Lost City Hydrothermal Field
The GO DNA repair system protects against GC → TA mutations by finding and removing oxidized guanine. The system is mechanistically well understood but its origins are unknown. We searched metagenomes and abundantly found the genes encoding GO DNA repair at the Lost City Hydrothermal Field (LCHF). We recombinantly expressed the final enzyme in the system to show MutY homologs function to suppress mutations. Microbes at the LCHF thrive without sunlight, fueled by the products of geochemical transformations of seafloor rocks, under conditions believed to resemble a young Earth. High levels of the reductant H2and low levels of O2in this environment raise the question, why are resident microbes equipped to repair damage caused by oxidative stress? MutY genes could be assigned to metagenome-assembled genomes (MAGs), and thereby associate GO DNA repair with metabolic pathways that generate reactive oxygen, nitrogen and sulfur species. Our results indicate that cell-based life was under evolutionary pressure to cope with oxidized guanine well before O2levels rose following the great oxidation event.  more » « less
Award ID(s):
2204229 1905249
PAR ID:
10608478
Author(s) / Creator(s):
; ; ; ; ;
Editor(s):
Gupta, Pramodkumar Pyarelal
Publisher / Repository:
Public Library of Science (PLOS)
Date Published:
Journal Name:
PLOS ONE
Volume:
19
Issue:
5
ISSN:
1932-6203
Page Range / eLocation ID:
e0284642
Subject(s) / Keyword(s):
DNA repair molecular evolution great oxidation event BER base excision repair MutY
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The GO DNA repair system protects against GC → TA mutations by finding and removing oxidized guanine. The system is mechanistically well understood but its origins are unknown. We searched metagenomes and abundantly found the genes encoding GO DNA repair at the Lost City Hydrothermal Field (LCHF). We recombinantly expressed the final enzyme in the system to show MutY homologs function to suppress mutations. Microbes at the LCHF thrive without sunlight, fueled by the products of geochemical transformations of seafloor rocks, under conditions believed to resemble a young Earth. High levels of the reductant H2and low levels of O2in this environment raise the question, why are resident microbes equipped to repair damage caused by oxidative stress? MutY genes could be assigned to metagenome assembled genomes (MAGs), and thereby associate GO DNA repair with metabolic pathways that generate reactive oxygen, nitrogen and sulfur species. Our results indicate that cell-based life was under evolutionary pressure to cope with oxidized guanine well before O2levels rose following the great oxidation event. 
    more » « less
  2. Greening, Chris (Ed.)
    ABSTRACT Aerobes require dioxygen (O2) to grow; anaerobes do not. However, nearly all microbes—aerobes, anaerobes, and facultative organisms alike—express enzymes whose substrates include O2, if only for detoxification. This presents a challenge when trying to assess which organisms are aerobic from genomic data alone. This challenge can be overcome by noting that O2utilization has wide-ranging effects on microbes: aerobes typically have larger genomes encoding distinctive O2-utilizing enzymes, for example. These effects permit high-quality prediction of O2utilization from annotated genome sequences, with several models displaying ≈80% accuracy on a ternary classification task for which blind guessing is only 33% accurate. Since genome annotation is compute-intensive and relies on many assumptions, we asked if annotation-free methods also perform well. We discovered that simple and efficient models based entirely on genomic sequence content—e.g., triplets of amino acids—perform as well as intensive annotation-based classifiers, enabling rapid processing of genomes. We further show that amino acid trimers are useful because they encode information about protein composition and phylogeny. To showcase the utility of rapid prediction, we estimated the prevalence of aerobes and anaerobes in diverse natural environments cataloged in the Earth Microbiome Project. Focusing on a well-studied O2gradient in the Black Sea, we found quantitative correspondence between local chemistry (O2:sulfide concentration ratio) and the composition of microbial communities. We, therefore, suggest that statistical methods like ours might be used to estimate, or “sense,” pivotal features of the chemical environment using DNA sequencing data.IMPORTANCEWe now have access to sequence data from a wide variety of natural environments. These data document a bewildering diversity of microbes, many known only from their genomes. Physiology—an organism’s capacity to engage metabolically with its environment—may provide a more useful lens than taxonomy for understanding microbial communities. As an example of this broader principle, we developed algorithms that accurately predict microbial dioxygen utilization directly from genome sequences without annotating genes, e.g., by considering only the amino acids in protein sequences. Annotation-free algorithms enable rapid characterization of natural samples, highlighting quantitative correspondence between sequences and local O2levels in a data set from the Black Sea. This example suggests that DNA sequencing might be repurposed as a multi-pronged chemical sensor, estimating concentrations of O2and other key facets of complex natural settings. 
    more » « less
  3. Abstract We present a new class of DNA‐based nanoswitches that, upon enzymatic repair, could undergo a conformational change mechanism leading to a change in fluorescent signal. Such folding‐upon‐repair DNA nanoswitches are synthetic DNA sequences containingO6‐methyl‐guanine (O6‐MeG) nucleobases and labelled with a fluorophore/quencher optical pair. The nanoswitches are rationally designed so that only upon enzymatic demethylation of theO6‐MeG nucleobases they can form stable intramolecular Hoogsteen interactions and fold into an optically active triplex DNA structure. We have first characterized the folding mechanism induced by the enzymatic repair activity through fluorescent experiments and Molecular Dynamics simulations. We then demonstrated that the folding‐upon‐repair DNA nanoswitches are suitable and specific substrates for different methyltransferase enzymes including the human homologue (hMGMT) and they allow the screening of novel potential methyltransferase inhibitors. 
    more » « less
  4. Abstract Lung cancer sequencing efforts have uncovered mutational signatures that are attributed to exposure to the cigarette smoke carcinogen benzo[a]pyrene. Benzo[a]pyrene metabolizes in cells to benzo[a]pyrene diol epoxide (BPDE) and reacts with guanine nucleotides to form bulky BPDE adducts. These DNA adducts block transcription and replication, compromising cell function and survival, and are repaired in human cells by the nucleotide excision repair pathway. Here, we applied high-resolution genomic assays to measure BPDE-induced damage formation and mutagenesis in human cells. We integrated the new damage and mutagenesis data with previous repair, DNA methylation, RNA expression, DNA replication, and chromatin component measurements in the same cell lines, along with lung cancer mutagenesis data. BPDE damage formation is significantly enhanced by DNA methylation and in accessible chromatin regions, including transcribed and early-replicating regions. Binding of transcription factors is associated primarily with reduced, but also enhanced damage formation, depending on the factor. While DNA methylation does not appear to influence repair efficiency, this repair was significantly elevated in accessible chromatin regions, which accumulated fewer mutations. Thus, when damage and repair drive mutagenesis in opposing directions, the final mutational patterns appear to be dictated by the efficiency of repair rather than the frequency of underlying damages. 
    more » « less
  5. Abstract DNA repair proteins can be recruited by their histone reader domains to specific epigenomic features, with consequences on intragenomic mutation rate variation. Here, we investigated H3K4me1-associated hypomutation in plants. We first examined 2 proteins which, in plants, contain Tudor histone reader domains: PRECOCIOUS DISSOCIATION OF SISTERS 5 (PDS5C), involved in homology-directed repair, and MUTS HOMOLOG 6 (MSH6), a mismatch repair protein. The MSH6 Tudor domain of Arabidopsis (Arabidopsis thaliana) binds to H3K4me1 as previously demonstrated for PDS5C, which localizes to H3K4me1-rich gene bodies and essential genes. Mutations revealed by ultradeep sequencing of wild-type and msh6 knockout lines in Arabidopsis show that functional MSH6 is critical for the reduced rate of single-base substitution (SBS) mutations in gene bodies and H3K4me1-rich regions. We explored the breadth of these mechanisms among plants by examining a large rice (Oryza sativa) mutation data set. H3K4me1-associated hypomutation is conserved in rice as are the H3K4me1-binding residues of MSH6 and PDS5C Tudor domains. Recruitment of DNA repair proteins by H3K4me1 in plants reveals convergent, but distinct, epigenome-recruited DNA repair mechanisms from those well described in humans. The emergent model of H3K4me1-recruited repair in plants is consistent with evolutionary theory regarding mutation modifier systems and offers mechanistic insight into intragenomic mutation rate variation in plants. 
    more » « less