skip to main content


Title: Mechanistic details of CRISPR-associated transposon recruitment and integration revealed by cryo-EM
CRISPR-associated transposons (CASTs) are Tn7-like elements that are capable of RNA-guided DNA integration. Although structural data are known for nearly all core transposition components, the transposase component, TnsB, remains uncharacterized. Using cryo-electron microscopy (cryo-EM) structure determination, we reveal the conformation of TnsB during transposon integration for the type V-K CAST system from Scytonema hofmanni (ShCAST). Our structure of TnsB is a tetramer, revealing strong mechanistic relationships with the overall architecture of RNaseH transposases/integrases in general, and in particular the MuA transposase from bacteriophage Mu. However, key structural differences in the C-terminal domains indicate that TnsB’s tetrameric architecture is stabilized by a different set of protein–protein interactions compared with MuA. We describe the base-specific interactions along the TnsB binding site, which explain how different CAST elements can function on cognate mobile elements independent of one another. We observe that melting of the 5′ nontransferred strand of the transposon end is a structural feature stabilized by TnsB and furthermore is crucial for donor–DNA integration. Although not observed in the TnsB strand-transfer complex, the C-terminal end of TnsB serves a crucial role in transposase recruitment to the target site. The C-terminal end of TnsB adopts a short, structured 15-residue “hook” that decorates TnsC filaments. Unlike full-length TnsB, C-terminal fragments do not appear to stimulate filament disassembly using two different assays, suggesting that additional interactions between TnsB and TnsC are required for redistributing TnsC to appropriate targets. The structural information presented here will help guide future work in modifying these important systems as programmable gene integration tools.  more » « less
Award ID(s):
1719875
NSF-PAR ID:
10411614
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
119
Issue:
32
ISSN:
0027-8424
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    CRISPR-associated transposases (CASTs) direct DNA integration downstream of target sites using the RNA-guided DNA binding activity of nuclease-deficient CRISPR-Cas systems. Transposition relies on several key protein-protein and protein-DNA interactions, but little is known about the explicit sequence requirements governing efficient transposon DNA integration activity. Here, we exploit pooled library screening and high-throughput sequencing to reveal novel sequence determinants during transposition by the Type I-F Vibrio cholerae CAST system (VchCAST). On the donor DNA, large transposon end libraries revealed binding site nucleotide preferences for the TnsB transposase, as well as an additional conserved region that encoded a consensus binding site for integration host factor (IHF). Remarkably, we found that VchCAST requires IHF for efficient transposition, thus revealing a novel cellular factor involved in CRISPR-associated transpososome assembly. On the target DNA, we uncovered preferred sequence motifs at the integration site that explained previously observed heterogeneity with single-base pair resolution. Finally, we exploited our library data to design modified transposon variants that enable in-frame protein tagging. Collectively, our results provide new clues about the assembly and architecture of the paired-end complex formed between TnsB and the transposon DNA, and inform the design of custom payload sequences for genome engineering applications with CAST systems.

     
    more » « less
  2. Abstract

    A multimer of retroviral integrase (IN) synapses viral DNA ends within a stable intasome nucleoprotein complex for integration into a host cell genome. Reconstitution of the intasome from the maedi-visna virus (MVV), an ovine lentivirus, revealed a large assembly containing sixteen IN subunits1. Herein, we report cryo-EM structures of the lentiviral intasome prior to engagement of target DNA and following strand transfer, refined at 3.4 and 3.5 Å resolution, respectively. The structures elucidate details of the protein-protein and protein-DNA interfaces involved in lentiviral intasome formation. We show that the homomeric interfaces involved in IN hexadecamer formation and the α-helical configuration of the linker connecting the C-terminal and catalytic core domains are critical for MVV IN strand transfer activity in vitro and for virus infectivity. Single-molecule microscopy in conjunction with photobleaching reveals that the MVV intasome can bind a variable number, up to sixteen molecules, of the lentivirus-specific host factor LEDGF/p75. Concordantly, ablation of endogenous LEDGF/p75 results in gross redistribution of MVV integration sites in human and ovine cells. Our data confirm the importance of the expanded architecture observed in cryo-EM studies of lentiviral intasomes and suggest that this organization underlies multivalent interactions with chromatin for integration targeting to active genes.

     
    more » « less
  3. Abstract Transposable elements represent the largest components of many eukaryotic genomes and different genomes harbor different combinations of elements. Here, we discovered a novel DNA transposon in the genome of the clubmoss Selaginella lepidophylla. Further searching for related sequences to the conserved DDE region uncovered the presence of this superfamily of elements in fish, coral, sea anemone, and other animal species. However, this element appears restricted to Bryophytes and Lycophytes in plants. This transposon, named GingerRoot, is associated with a 6 bp (base pair) target site duplication, and 100–150 bp terminal inverted repeats. Analysis of transposase sequences identified the DDE motif, a catalytic domain, which shows similarity to the integrase of Gypsy-like long terminal repeat retrotransposons, the most abundant component in plant genomes. A total of 77 intact and several hundred truncated copies of GingerRoot elements were identified in S. lepidophylla. Like Gypsy retrotransposons, GingerRoots show a lack of insertion preference near genes, which contrasts to the compact genome size of about 100 Mb. Nevertheless, a considerable portion of GingerRoot elements was found to carry gene fragments, suggesting the capacity of duplicating gene sequences is unlikely attributed to the proximity to genes. Elements carrying gene fragments appear to be less methylated, more diverged, and more distal to genes than those without gene fragments, indicating they are preferentially retained in gene-poor regions. This study has identified a broadly dispersed, novel DNA transposon, and the first plant DNA transposon with an integrase-related transposase, suggesting the possibility of de novo formation of Gypsy-like elements in plants. 
    more » « less
  4. Frappier, Lori (Ed.)
    ABSTRACT Ubiquitous and abundant in ecosystems and microbiomes, gokushoviruses constitute a Microviridae subfamily, distantly related to bacteriophages ΦX174, α3, and G4. A high-resolution cryo-EM structure of gokushovirus ΦEC6098 was determined, and the atomic model was built de novo . Although gokushoviruses lack external scaffolding and spike proteins, which extensively interact with the ΦX174 capsid protein, the core of the ΦEC6098 coat protein (VP1) displayed a similar structure. There are, however, key differences. At each ΦEC6098 icosahedral 3-fold axis, a long insertion loop formed mushroom-like protrusions, which have been noted in lower-resolution gokushovirus structures. Hydrophobic interfaces at the bottom of these protrusions may confer stability to the capsid shell. In ΦX174, the N-terminus of the capsid protein resides directly atop the 3-fold axes of symmetry; however, the ΦEC6098 N-terminus stretched across the inner surface of the capsid shell, reaching nearly to the 5-fold axis of the neighboring pentamer. Thus, this extended N-terminus interconnected pentamers on the inside of the capsid shell, presumably promoting capsid assembly, a function performed by the ΦX174 external scaffolding protein. There were also key differences between the ΦX174-like DNA-binding J proteins and its ΦEC6098 homologue VP8. As seen with the J proteins, C-terminal VP8 residues were bound into a pocket within the major capsid protein; however, its N-terminal residues were disordered, likely due to flexibility. We show that the combined location and interaction of VP8’s C-terminus and a portion of VP1’s N-terminus are reminiscent of those seen with the ΦX174 and α3 J proteins. IMPORTANCE There is a dramatic structural and morphogenetic divide within the Microviridae . The well-studied ΦX174-like viruses have prominent spikes at their icosahedral vertices, which are absent in gokushoviruses. Instead, gokushovirus major coat proteins form extensive mushroom-like protrusions at the 3-fold axes of symmetry. In addition, gokushoviruses lack an external scaffolding protein, the more critical of the two ΦX174 assembly proteins, but retain an internal scaffolding protein. The ΦEC6098 virion suggests that key external scaffolding functions are likely performed by coat protein domains unique to gokushoviruses. Thus, within one family, different assembly paths have been taken, demonstrating how a two-scaffolding protein system can evolve into a one-scaffolding protein system, or vice versa. 
    more » « less
  5. Heterochromatin is most often associated with eukaryotic organisms. Yet, bacteria also contain areas with densely protein-occupied chromatin that appear to silence gene expression. One nucleoid-associated silencing factor is the conserved protein Hfq. Although seemingly nonspecific in its DNA binding properties, Hfq is strongly enriched at AT-rich DNA regions, characteristic of prophages and mobile genetic elements. Here, we demonstrate that polyphosphate (polyP), an ancient and highly conserved polyanion, is essential for the site-specific DNA binding properties of Hfq in bacteria. Absence of polyP markedly alters the DNA binding profile of Hfq, causes unsolicited prophage and transposon mobilization, and increases mutagenesis rates and DNA damage–induced cell death. In vitro reconstitution of the system revealed that Hfq and polyP interact with AT-rich DNA sequences and form phase-separated condensates, a process that is mediated by the intrinsically disordered C-terminal extensions of Hfq. We propose that polyP serves as a newly identified driver of heterochromatin formation in bacteria. 
    more » « less