skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Modeling RNA:DNA Hybrids with Formal Grammars
R-loops are nucleic acid structures consisting of a DNA:RNA hybrid and a DNA single strand. They form naturally during transcription when the nascent RNA hybridizes to the template DNA, forcing the coding DNA strand to wrap around the RNA:DNA duplex. Although formation of R-loops can have deleterious effects on genome integrity, there is evidence of their role as potential regulators of gene expression and DNA repair. Here we initiate an abstract model based on formal grammars to describe RNA:DNA interactions and the formation of R-loops. Separately we use a sliding window approach that accounts for properties of the DNA nucleotide sequence, such as C-richness and CG-skew, to identify segments favoring R-loops. We evaluate these properties on two DNA plasmids that are known to form R-loops and compare results with a recent energetics model from the Chédin Lab. Our abstract approach for R-loops is an initial step toward a more sophisticated framework which can take into account the effect of DNA topology on R-loop formation.  more » « less
Award ID(s):
1815832
PAR ID:
10292297
Author(s) / Creator(s):
; ; ; ; ;
Editor(s):
Segal, R.; Shtylla, B.; Sindi, S
Date Published:
Journal Name:
Using Mathematics to Understand Biological Complexity
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Segal, R.; Shtylla, B.; Sindi, S (Ed.)
    R-loops are nucleic acid structures consisting of a DNA:RNA hybrid and a DNA single strand. They form naturally during transcription when the nascent RNA hybridizes to the template DNA, forcing the coding DNA strand to wrap around the RNA:DNA duplex. Although formation of R-loops can have deleterious effects on genome integrity, there is evidence of their role as potential regulators of gene expression and DNA repair. Here we initiate an abstract model based on formal grammars to describe RNA:DNA interactions and the formation of R-loops. Separately we use a sliding window approach that accounts for properties of the DNA nucleotide sequence, such as C-richness and CG-skew, to identify segments favoring R-loops. We evaluate these properties on two DNA plasmids that are known to form R-loops and compare results with a recent energetics model from the Chédin Lab. Our abstract approach for R-loops is an initial step toward a more sophisticated framework which can take into account the effect of DNA topology on R-loop formation. 
    more » « less
  2. Flap endonuclease 1 (FEN1) is an essential enzyme that removes RNA primers and base lesions during DNA lagging strand maturation and long-patch base excision repair (BER). It plays a crucial role in maintaining genome stability and integrity. FEN1 is also implicated in RNA processing and biogenesis. A recent study from our group has shown that FEN1 is involved in trinucleotide repeat deletion by processing the RNA strand in R-loops through BER, further suggesting that the enzyme can modulate genome stability by facilitating the resolution of R-loops. However, it remains unknown how FEN1 can process RNA to resolve an R-loop. In this study, we examined the FEN1 cleavage activity on the RNA:DNA hybrid intermediates generated during DNA lagging strand processing and BER in R-loops. We found that both human and yeast FEN1 efficiently cleaved an RNA flap in the intermediates using its endonuclease activity. We further demonstrated that FEN1 was recruited to R-loops in normal human fibroblasts and senataxin-deficient (AOA2) fibroblasts, and its R-loop recruitment was significantly increased by oxidative DNA damage. We showed that FEN1 specifically employed its endonucleolytic cleavage activity to remove the RNA strand in an R-loop during BER. We found that FEN1 coordinated its DNA and RNA endonucleolytic cleavage activity with the 3′-5′ exonuclease of APE1 to resolve the R-loop. Our results further suggest that FEN1 employed its unique tracking mechanism to endonucleolytically cleave the RNA strand in an R-loop by coordinating with other BER enzymes and cofactors during BER. Our study provides the first evidence that FEN1 endonucleolytic cleavage can result in the resolution of R-loops via the BER pathway, thereby maintaining genome integrity. 
    more » « less
  3. Kursula, Petri (Ed.)
    Short segments of RNA displace one strand of a DNA duplex during diverse processes including transcription and CRISPR-mediated immunity and genome editing. These strand exchange events involve the intersection of two geometrically distinct helix types—an RNA:DNA hybrid (A-form) and a DNA:DNA homoduplex (B-form). Although previous evidence suggests that these two helices can stack on each other, it is unknown what local geometric adjustments could enable A-on-B stacking. Here we report the X-ray crystal structure of an RNA-5′/DNA-3′ strand exchange junction at an anisotropic resolution of 1.6 to 2.2 Å. The structure reveals that the A-to-B helical transition involves a combination of helical axis misalignment, helical axis tilting and compression of the DNA strand within the RNA:DNA helix, where nucleotides exhibit a mixture of A- and B-form geometry. These structural principles explain previous observations of conformational stability in RNA/DNA exchange junctions, enabling a nucleic acid architecture that is repeatedly populated during biological strand exchange events. 
    more » « less
  4. Chen, Shi-Jie (Ed.)
    R-loops are a class of non-canonical nucleic acid structures that typically form during transcription when the nascent RNA hybridizes the DNA template strand, leaving the non-template DNA strand unpaired. These structures are abundant in nature and play important physiological and pathological roles. Recent research shows that DNA sequence and topology affect R-loops, yet it remains unclear how these and other factors contribute to R-loop formation. In this work, we investigate the link between nascent RNA folding and the formation of R-loops. We introduce tree-polynomials, a new class of representations of RNA secondary structures. A tree-polynomial representation consists of a rooted tree associated with an RNA secondary structure together with a polynomial that is uniquely identified with the rooted tree. Tree-polynomials enable accurate, interpretable and efficient data analysis of RNA secondary structures without pseudoknots. We develop a computational pipeline for investigating and predicting R-loop formation from a genomic sequence. The pipeline obtains nascent RNA secondary structures from a co-transcriptional RNA folding software, and computes the tree-polynomial representations of the structures. By applying this pipeline to plasmid sequences that contain R-loop forming genes, we establish a strong correlation between the coefficient sums of tree-polynomials and the experimental probability of R-loop formation. Such strong correlation indicates that the pipeline can be used for accurate R-loop prediction. Furthermore, the interpretability of tree-polynomials allows us to characterize the features of RNA secondary structure associated with R-loop formation. In particular, we identify that branches with short stems separated by bulges and interior loops are associated with R-loops. 
    more » « less
  5. Abstract Accumulation of DNA–RNA hybrids in the form of R-loops can result in replication–transcription conflict that leads to the formation of DNA double strand breaks (DSBs). Using null mutants for the two Caenorhabditis elegans genes encoding for RNaseH1 and RNaseH2, we identify novel effects of R-loop accumulation in the germline. R-loop accumulation leads, as expected, to replication stress, followed by the formation of DSBs. A subset of these DSBs are irreparable. However, unlike irreparable DSBs generated in other systems, which trigger permanent cell cycle arrest, germline irreparable DSBs are propagated to oocytes. Despite DNA damage checkpoint activation in the stem cell niche, the signaling cannot be sustained and nuclei with irreparable DNA damage progress into meiosis. Moreover, unlike other forms of DNA damage that increase germline apoptosis, R-loop-generated DSBs remain undetected by the apoptotic checkpoint. This coincides with attenuation of ATM/ATR signaling in mid-to-late meiotic prophase I. These data altogether indicate that in the germline, DSBs that are generated by R-loops can lead to irreparable DSBs that evade cellular machineries designed for damage recognition. These studies implicate germline R-loops as an especially dangerous driver of germline mutagenesis. 
    more » « less