skip to main content


Title: Structural coordination between active sites of a CRISPR reverse transcriptase-integrase complex
Abstract

CRISPR-Cas systems provide adaptive immunity in bacteria and archaea, beginning with integration of foreign sequences into the host CRISPR genomic locus and followed by transcription and maturation of CRISPR RNAs (crRNAs). In some CRISPR systems, a reverse transcriptase (RT) fusion to the Cas1 integrase and Cas6 maturase creates a single protein that enables concerted sequence integration and crRNA production. To elucidate how the RT-integrase organizes distinct enzymatic activities, we present the cryo-EM structure of a Cas6-RT-Cas1—Cas2 CRISPR integrase complex. The structure reveals a heterohexamer in which the RT directly contacts the integrase and maturase domains, suggesting functional coordination between all three active sites. Together with biochemical experiments, our data support a model of sequential enzymatic activities that enable CRISPR sequence acquisition from RNA and DNA substrates. These findings highlight an expanded capacity of some CRISPR systems to acquire diverse sequences that direct CRISPR-mediated interference.

 
more » « less
Award ID(s):
1817593
NSF-PAR ID:
10226618
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Nature Communications
Volume:
12
Issue:
1
ISSN:
2041-1723
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    CRISPR–Cas adaptive immune systems capture DNA fragments from invading mobile genetic elements and integrate them into the host genome to provide a template for RNA-guided immunity1. CRISPR systems maintain genome integrity and avoid autoimmunity by distinguishing between self and non-self, a process for which the CRISPR/Cas1–Cas2 integrase is necessary but not sufficient2–5. In some microorganisms, the Cas4 endonuclease assists CRISPR adaptation6,7, but many CRISPR–Cas systems lack Cas48. Here we show here that an elegant alternative pathway in a type I-E system uses an internal DnaQ-like exonuclease (DEDDh) to select and process DNA for integration using the protospacer adjacent motif (PAM). The natural Cas1–Cas2/exonuclease fusion (trimmer-integrase) catalyses coordinated DNA capture, trimming and integration. Five cryo-electron microscopy structures of the CRISPR trimmer-integrase, visualized both before and during DNA integration, show how asymmetric processing generates size-defined, PAM-containing substrates. Before genome integration, the PAM sequence is released by Cas1 and cleaved by the exonuclease, marking inserted DNA as self and preventing aberrant CRISPR targeting of the host. Together, these data support a model in which CRISPR systems lacking Cas4 use fused or recruited9,10exonucleases for faithful acquisition of new CRISPR immune sequences.

     
    more » « less
  2. null (Ed.)
    CRISPR-Cas9 is an RNA-guided DNA endonuclease involved in bacterial adaptive immunity and widely repurposed for genome editing in human cells, animals and plants. In bacteria, RNA molecules that guide Cas9′s activity derive from foreign DNA fragments that are captured and integrated into the host CRISPR genomic locus by the Cas1-Cas2 CRISPR integrase. How cells generate the specific lengths of DNA required for integrase capture is a central unanswered question of type II-A CRISPR-based adaptive immunity. Here, we show that an integrase supercomplex comprising guide RNA and the proteins Cas1, Cas2, Csn2 and Cas9 generates precisely trimmed 30-base pair DNA molecules required for genome integration. The HNH active site of Cas9 catalyzes exonucleolytic DNA trimming by a mechanism that is independent of the guide RNA sequence. These results show that Cas9 possesses a distinct catalytic capacity for generating immunological memory in prokaryotes. 
    more » « less
  3. ABSTRACT Viral infection exerts selection pressure on marine microbes, as virus-induced cell lysis causes 20 to 50% of cell mortality, resulting in fluxes of biomass into oceanic dissolved organic matter. Archaeal and bacterial populations can defend against viral infection using the clustered regularly interspaced short palindromic repeat (CRISPR)-associated (Cas) system, which relies on specific matching between a spacer sequence and a viral gene. If a CRISPR spacer match to any gene within a viral genome is equally effective in preventing lysis, no viral genes should be preferentially matched by CRISPR spacers. However, if there are differences in effectiveness, certain viral genes may demonstrate a greater frequency of CRISPR spacer matches. Indeed, homology search analyses of bacterioplankton CRISPR spacer sequences against virioplankton sequences revealed preferential matching of replication proteins, nucleic acid binding proteins, and viral structural proteins. Positive selection pressure for effective viral defense is one parsimonious explanation for these observations. CRISPR spacers from virioplankton metagenomes preferentially matched methyltransferase and phage integrase genes within virioplankton sequences. These virioplankton CRISPR spacers may assist infected host cells in defending against competing phage. Analyses also revealed that half of the spacer-matched viral genes were unknown, some genes matched several spacers, and some spacers matched multiple genes, a many-to-many relationship. Thus, CRISPR spacer matching may be an evolutionary algorithm, agnostically identifying those genes under stringent selection pressure for sustaining viral infection and lysis. Investigating this subset of viral genes could reveal those genetic mechanisms essential to virus-host interactions and provide new technologies for optimizing CRISPR defense in beneficial microbes. IMPORTANCE The CRISPR-Cas system is one means by which bacterial and archaeal populations defend against viral infection which causes 20 to 50% of cell mortality in the ocean. We tested the hypothesis that certain viral genes are preferentially targeted for the initial attack of the CRISPR-Cas system on a viral genome. Using CASC, a pipeline for CRISPR spacer discovery, and metagenome data from oceanic microbes and viruses, we found a clear subset of viral genes with high match frequencies to CRISPR spacers. Moreover, we observed a many-to-many relationship of spacers and viral genes. These high-match viral genes were involved in nucleotide metabolism, DNA methylation, and viral structure. It is possible that CRISPR spacer matching is an evolutionary algorithm pointing to those viral genes most important to sustaining infection and lysis. Studying these genes may advance the understanding of virus-host interactions in nature and provide new technologies for leveraging CRISPR-Cas systems in beneficial microbes. 
    more » « less
  4. Abstract Background

    CRISPR-Cas (clustered regularly interspaced short palindromic repeats—CRISPR-associated proteins) systems are adaptive immune systems commonly found in prokaryotes that provide sequence-specific defense against invading mobile genetic elements (MGEs). The memory of these immunological encounters are stored in CRISPR arrays, where spacer sequences record the identity and history of past invaders. Analyzing such CRISPR arrays provide insights into the dynamics of CRISPR-Cas systems and the adaptation of their host bacteria to rapidly changing environments such as the human gut.

    Results

    In this study, we utilized 601 publicly availableBacteroides fragilisgenome isolates from 12 healthy individuals, 6 of which include longitudinal observations, and 222 availableB. fragilisreference genomes to update the understanding ofB. fragilisCRISPR-Cas dynamics and their differential activities. Analysis of longitudinal genomic data showed that some CRISPR array structures remained relatively stable over time whereas others involved radical spacer acquisition during some periods, and diverse CRISPR arrays (associated with multiple isolates) co-existed in the same individuals with some persisted over time. Furthermore, features of CRISPR adaptation, evolution, and microdynamics were highlighted through an analysis of host-MGE network, such as modules of multiple MGEs and hosts, reflecting complex interactions betweenB. fragilisand its invaders mediated through the CRISPR-Cas systems.

    Conclusions

    We made available of all annotated CRISPR-Cas systems and their target MGEs, and their interaction network as a web resource athttps://omics.informatics.indiana.edu/CRISPRone/Bfragilis. We anticipate it will become an important resource for studying ofB. fragilis, its CRISPR-Cas systems, and its interaction with mobile genetic elements providing insights into evolutionary dynamics that may shape the species virulence and lead to its pathogenicity.

     
    more » « less
  5. Abstract

    CRISPR-associated transposases (CASTs) direct DNA integration downstream of target sites using the RNA-guided DNA binding activity of nuclease-deficient CRISPR-Cas systems. Transposition relies on several key protein-protein and protein-DNA interactions, but little is known about the explicit sequence requirements governing efficient transposon DNA integration activity. Here, we exploit pooled library screening and high-throughput sequencing to reveal novel sequence determinants during transposition by the Type I-F Vibrio cholerae CAST system (VchCAST). On the donor DNA, large transposon end libraries revealed binding site nucleotide preferences for the TnsB transposase, as well as an additional conserved region that encoded a consensus binding site for integration host factor (IHF). Remarkably, we found that VchCAST requires IHF for efficient transposition, thus revealing a novel cellular factor involved in CRISPR-associated transpososome assembly. On the target DNA, we uncovered preferred sequence motifs at the integration site that explained previously observed heterogeneity with single-base pair resolution. Finally, we exploited our library data to design modified transposon variants that enable in-frame protein tagging. Collectively, our results provide new clues about the assembly and architecture of the paired-end complex formed between TnsB and the transposon DNA, and inform the design of custom payload sequences for genome engineering applications with CAST systems.

     
    more » « less