skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Structure analysis suggests Ess1 isomerizes the carboxy-terminal domain of RNA polymerase II via a bivalent anchoring mechanism
Abstract Accurate gene transcription in eukaryotes depends on isomerization of serine-proline bonds within the carboxy-terminal domain (CTD) of RNA polymerase II. Isomerization is part of the “CTD code” that regulates recruitment of proteins required for transcription and co-transcriptional RNA processing. Saccharomyces cerevisiae Ess1 and its human ortholog, Pin1, are prolyl isomerases that engage the long heptad repeat (YSPTSPS) 26 of the CTD by an unknown mechanism. Here, we used an integrative structural approach to decipher Ess1 interactions with the CTD. Ess1 has a rigid linker between its WW and catalytic domains that enforces a distance constraint for bivalent interaction with the ends of long CTD substrates (≥4–5 heptad repeats). Our binding results suggest that the Ess1 WW domain anchors the proximal end of the CTD substrate during isomerization, and that linker divergence may underlie evolution of substrate specificity.  more » « less
Award ID(s):
1750462
PAR ID:
10249500
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
Communications Biology
Volume:
4
Issue:
1
ISSN:
2399-3642
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract The SARS-CoV-2 nucleocapsid (N) protein performs several functions including binding, compacting, and packaging the ∼30 kb viral genome into the viral particle. N protein consists of two ordered domains, with the N terminal domain (NTD) primarily associated with RNA binding and the C terminal domain (CTD) primarily associated with dimerization/oligomerization, and three intrinsically disordered regions, an N-arm, a C-tail, and a linker that connects the NTD and CTD. We utilize an optical tweezers system to isolate a long single-stranded nucleic acid substrate to measure directly the binding and packaging function of N protein at a single molecule level in real time. We find that N protein binds the nucleic acid substrate with high affinity before oligomerizing and forming a highly compact structure. By comparing the activities of truncated protein variants missing the NTD, CTD, and/or linker, we attribute specific steps in this process to the structural domains of N protein, with the NTD driving initial binding to the substrate and ensuring high localized protein density that triggers interprotein interactions mediated by the CTD, which forms a compact and stable protein-nucleic acid complex suitable for packaging into the virion. 
    more » « less
  2. Millet, Oscar (Ed.)
    Cyp33 is an essential human cyclophilin prolyl isomerase that plays myriad roles in splicing and chromatin remodeling. In addition to a canonical cyclophilin (Cyp) domain, Cyp33 contains an RNA-recognition motif (RRM) domain, and RNA-binding triggers proline isomerase activity. One prominent role for Cyp33 is through a direct interaction with the mixed lineage leukemia protein 1 (MLL1, also known as KMT2A) complex, which is a histone methyltransferase that serves as a global regulator of human transcription. MLL activity is regulated by Cyp33, which isomerizes a key proline in the linker between the PHD3 and Bromo domains of MLL1, acting as a switch between gene activation and repression. The direct interaction between MLL1 and Cyp33 is critical, as deletion of the MLL1-PHD3 domain responsible for this interaction results in oncogenesis. The Cyp33 RRM is central to these activities, as it binds both the PHD3 domain and RNA. To better understand how RNA binding drives the action of Cyp33, we performed RNA-SELEX against full-length Cyp33 accompanied by deep sequencing. We have identified an enriched Cyp33 binding motif ( AAUAAUAA ) broadly represented in the cellular RNA pool as well as tightly binding RNA aptamers with affinities comparable and competitive with the Cyp33 MLL1-PHD3 interaction. RNA binding extends beyond the canonical RRM domain, but not to the Cyp domain, suggesting an indirect mechanism of interaction. NMR chemical shift mapping confirms an overlapping, but not identical, interface on Cyp33 for RNA and PHD3 binding. This finding suggests RNA can disrupt the gene repressive Cyp33-MLL1 complex providing another layer of regulation for chromatin remodeling by MLL1. 
    more » « less
  3. Transcription factors are multidomain proteins with specific DNA binding and regulatory domains. In the human FoxP subfamily (FoxP1, FoxP2, FoxP3, and FoxP4) of transcription factors, a 90 residue-long disordered region links a Leucine Zipper (ZIP)—known to form coiled-coil dimers—and a Forkhead (FKH) domain—known to form domain swapping dimers. We used replica exchange discrete molecular dynamics simulations, single-molecule fluorescence experiments, and other biophysical tools to understand how domain tethering in FoxP1 impacts dimerization at ZIP and FKH domains and how DNA binding allosterically regulates their dimerization. We found that domain tethering promotes FoxP1 dimerization but inhibits a FKH domain-swapped structure. Furthermore, our findings indicate that the linker mediates the mutual organization and dynamics of ZIP and FKH domains, forming closed and open states with and without interdomain contacts, thus highlighting the role of the linkers in multidomain proteins. Finally, we found that DNA allosterically promotes structural changes that decrease the dimerization propensity of FoxP1. We postulate that, upon DNA binding, the interdomain linker plays a crucial role in the gene regulatory function of FoxP1. 
    more » « less
  4. Abstract The Pcf11 protein is an essential subunit of the large complex that cleaves and polyadenylates eukaryotic mRNA precursor. It has also been functionally linked to gene-looping, termination of RNA Polymerase II (Pol II) transcripts, and mRNA export. We have examined a poorly characterized but conserved domain (amino acids 142–225) of the Saccharomyces cerevisiae  Pcf11 and found that while it is not needed for mRNA 3′ end processing or termination downstream of the poly(A) sites of protein-coding genes, its presence improves the interaction with Pol II and the use of transcription terminators near gene promoters. Analysis of genome-wide Pol II occupancy in cells with Pcf11 missing this region, as well as Pcf11 mutated in the Pol II CTD Interacting Domain, indicates that systematic changes in mRNA expression are mediated primarily at the level of transcription. Global expression analysis also shows that a general stress response, involving both activation and suppression of specific gene sets known to be regulated in response to a wide variety of stresses, is induced in the two pcf11 mutants, even though cells are grown in optimal conditions. The mutants also cause an unbalanced expression of cell wall-related genes that does not activate the Cell Wall Integrity pathway but is associated with strong caffeine sensitivity. Based on these findings, we propose that Pcf11 can modulate the expression level of specific functional groups of genes in ways that do not involve its well-characterized role in mRNA 3′ end processing. 
    more » « less
  5. The ability to understand the function of a protein often relies on knowledge about its detailed structure. Sometimes, seemingly insignificant changes in the primary structure of a protein, like an amino acid substitution, can completely disrupt a protein's function. Long-lived proteins (LLPs), which can be found in critical areas of the human body, like the brain and eye, are especially susceptible to primary sequence alterations in the form of isomerization and epimerization. Because long-lived proteins do not have the corrective regeneration capabilities of most other proteins, points of isomerism and epimerization that accumulate within the proteins can severely hamper their functions and can lead to serious diseases like Alzheimer's disease, cancer and cataracts. Whereas tandem mass spectrometry (MS/MS) in the form of collision-induced dissociation (CID) generally excels at peptide characterization, MS/MS often struggles to pinpoint modifications within LLPs, especially when the differences are only isomeric or epimeric in nature. One of the most prevalent and difficult-to-identify modifications is that of aspartic acid between its four isomeric forms: l -Asp, l -isoAsp, d -Asp, and d -isoAsp. In this study, peptides containing isomers of Asp were analyzed by charge transfer dissociation (CTD) mass spectrometry to identify spectral features that could discriminate between the different isomers. For the four isomers of Asp in three model peptides, CTD produced diagnostic ions of the form c n +57 on the N-terminal side of iso-Asp residues, but not on the N-terminal side of Asp residues. Using CTD, the l - and d forms of Asp and isoAsp could also be differentiated based on the relative abundance of y - and z ions on the C-terminal side of Asp residues. Differentiation was accomplished through a chiral discrimination factor, R , which compares an ion ratio in a spectrum of one epimer or isomer to the same ion ratio in the spectrum of a different epimer or isomer. The R values obtained using CTD are as robust and statistically significant as other fragmentation techniques, like radical directed dissociation (RDD). In summary, the extent of backbone and side-chain fragments produced by CTD enabled the differentiation of isomers and epimers of Asp in a variety of peptides. 
    more » « less