skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Structural domains of SARS-CoV-2 nucleocapsid protein coordinate to compact long nucleic acid substrates
Abstract The SARS-CoV-2 nucleocapsid (N) protein performs several functions including binding, compacting, and packaging the ∼30 kb viral genome into the viral particle. N protein consists of two ordered domains, with the N terminal domain (NTD) primarily associated with RNA binding and the C terminal domain (CTD) primarily associated with dimerization/oligomerization, and three intrinsically disordered regions, an N-arm, a C-tail, and a linker that connects the NTD and CTD. We utilize an optical tweezers system to isolate a long single-stranded nucleic acid substrate to measure directly the binding and packaging function of N protein at a single molecule level in real time. We find that N protein binds the nucleic acid substrate with high affinity before oligomerizing and forming a highly compact structure. By comparing the activities of truncated protein variants missing the NTD, CTD, and/or linker, we attribute specific steps in this process to the structural domains of N protein, with the NTD driving initial binding to the substrate and ensuring high localized protein density that triggers interprotein interactions mediated by the CTD, which forms a compact and stable protein-nucleic acid complex suitable for packaging into the virion.  more » « less
Award ID(s):
1817712
PAR ID:
10386253
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Nucleic Acids Research
Volume:
51
Issue:
1
ISSN:
0305-1048
Page Range / eLocation ID:
p. 290-303
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Human γD-crystallin, a monomeric protein abundant in the eye lens nucleus, must remain stably folded for an individual’s entire lifetime to avoid aggregation and protein deposition-associated cataract formation. γD-crystallin contains two homologous domains, an N-terminal domain (NTD) and a C-terminal domain (CTD), which interact via a hydrophobic interface. Several familial mutations in the gamma crystallin gene are linked to congenital early-onset cataract, most of which affect the NTD. Some of these, including V75D and W42R, are known to populate intermediates under partially denaturing conditions possessing a natively folded CTD and a completely unfolded NTD. We employed hydrogen–deuterium exchange mass spectrometry to probe the structural and energetic features of variants of γD-crystallin under both native and partially denaturing conditions. For V75D and W42R, we identify a species under native conditions that retains partial structure in the NTD and is structurally and energetically distinct from the intermediate populated under partially denaturing conditions. Residues at the NTD–CTD interface play crucial roles in stabilizing this intermediate, and disruption of interface contacts either by amino acid substitution or partial denaturation permits direct observation of two intermediates simultaneously. These data suggest that the intermediate identified under native conditions is accessed from the native state and not on the folding pathway. The intermediate we have identified here exposes hydrophobic amino acids that are buried in both the folded full-length protein and in the protein’s stable isolated domains. Such nonnative exposure of a hydrophobic patch may play an important role in cataract formation. 
    more » « less
  2. The nucleocapsid protein (N) of SARS-CoV-2 is essential for virus replication, genome packaging, evading host immunity, and virus maturation. N is a multidomain protein composed of an independently folded monomeric N-terminal domain that is the primary site for RNA binding and a dimeric C-terminal domain that is essential for efficient phase separation and condensate formation with RNA. The domains are separated by a disordered Ser/Arg-rich region preceding a self-associating Leu-rich helix. Phosphorylation in the Ser/Arg region in infected cells decreases the viscosity of N:RNA condensates promoting viral replication and host immune evasion. The molecular level effect of phosphorylation, however, is missing from our current understanding. Using NMR spectroscopy and analytical ultracentrifugation, we show that phosphorylation destabilizes the self-associating Leu-rich helix 30 amino-acids distant from the phosphorylation site. NMR and gel shift assays demonstrate that RNA binding by the linker is dampened by phosphorylation, whereas RNA binding to the full-length protein is not significantly affected presumably due to retained strong interactions with the primary RNA-binding domain. Introducing a switchable self-associating domain to replace the Leu-rich helix confirms the importance of linker self-association to droplet formation and suggests that phosphorylation not only increases solubility of the positively charged elongated Ser/Arg region as observed in other RNA-binding proteins but can also inhibit self-association of the Leu-rich helix. These data highlight the effect of phosphorylation both at local sites and at a distant self-associating hydrophobic helix in regulating liquid-liquid phase separation of the entire protein. 
    more » « less
  3. na (Ed.)
    T-Cell Intracellular Antigen-1 (TIA1) is a 43 kDa multi-domain RNA-binding protein involved in stress granule formation during eukaryotic stress response, and has been implicated in neurodegenerative diseases including Welander distal myopathy and amyotrophic lateral sclerosis. TIA1 contains three RNA recognition motifs (RRMs), which are capable of binding nucleic acids and a C-terminal Q/N-rich prion-related domain (PRD) which has been variously described as intrinsically disordered or prion inducing and is believed to play a role in promoting liquid-liquid phase separation connected with the assembly of stress granule formation. Motivated by the fact that our prior work shows RRMs 2 and 3 are well-ordered in an oligomeric full-length form, while RRM1 and the PRD appear to phase separate, the present work addresses whether the oligomeric form is functional and competent for binding, and probes the consequences of nucleic acid binding for oligomerization and protein conformation change. New SSNMR data show that ssDNA binds to full-length oligomeric TIA1 primarily at the RRM2 domain, but also weakly at the RRM3 domain, and Zn2+ binds primarily to RRM3. Binding of Zn2+ and DNA was reversible for the full-length wild type oligomeric form, and did not lead to formation of amyloid fibrils, despite the presence of the C-terminal prion-related domain. While TIA1:DNA complexes appear as long “daisy chained” structures, the addition of Zn2+ caused the structures to collapse. We surmise that this points to a regulatory role for Zn2+. By occupying various “half” binding sites on RRM3 Zn2+ may shift the nucleic acid binding off RRM3 and onto RRM2. More importantly, the use of different half sites on different monomers may introduce a mesh of crosslinks in the supramolecular complex rendering it compact and markedly reducing the access to the nucleic acids (including transcripts) from solution. 
    more » « less
  4. null (Ed.)
    Abstract Accurate gene transcription in eukaryotes depends on isomerization of serine-proline bonds within the carboxy-terminal domain (CTD) of RNA polymerase II. Isomerization is part of the “CTD code” that regulates recruitment of proteins required for transcription and co-transcriptional RNA processing. Saccharomyces cerevisiae Ess1 and its human ortholog, Pin1, are prolyl isomerases that engage the long heptad repeat (YSPTSPS) 26 of the CTD by an unknown mechanism. Here, we used an integrative structural approach to decipher Ess1 interactions with the CTD. Ess1 has a rigid linker between its WW and catalytic domains that enforces a distance constraint for bivalent interaction with the ends of long CTD substrates (≥4–5 heptad repeats). Our binding results suggest that the Ess1 WW domain anchors the proximal end of the CTD substrate during isomerization, and that linker divergence may underlie evolution of substrate specificity. 
    more » « less
  5. Bacteriophage T4 gene 32 protein (gp32) is a single-stranded DNA (ssDNA) binding protein essential for DNA replication. gp32 forms stable protein filaments on ssDNA through cooperative interactions between its core and N-terminal domain. gp32′s C-terminal domain (CTD) is believed to primarily help coordinate DNA replication via direct interactions with constituents of the replisome. However, the exact mechanisms of these interactions are not known, and it is unclear how tightly-bound gp32 filaments are readily displaced from ssDNA as required for genomic processing. Here, we utilized truncated gp32 variants to demonstrate a key role of the CTD in regulating gp32 dissociation. Using optical tweezers, we probed the binding and dissociation dynamics of CTD-truncated gp32, *I, to an 8.1 knt ssDNA molecule and compared these measurements with those for full-length gp32. The *I-ssDNA helical filament becomes progressively unwound with increased protein concentration but remains significantly more stable than that of full-length, wild-type gp32. Protein oversaturation, concomitant with filament unwinding, facilitates rapid dissociation of full-length gp32 from across the entire ssDNA segment. In contrast, *I primarily unbinds slowly from only the ends of the cooperative clusters, regardless of the protein density and degree of DNA unwinding. Our results suggest that the CTD may constrain the relative twist angle of proteins within the ssDNA filament such that upon critical unwinding the cooperative interprotein interactions largely vanish, facilitating prompt removal of gp32. We propose a model of CTD-mediated gp32 displacement via internal restructuring of its filament, providing a mechanism for rapid ssDNA clearing during genomic processing. 
    more » « less