skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: The N‐terminus of obscurin is flexible in solution
Abstract The N‐terminal half of the giant cytoskeletal protein obscurin is comprised of more than 50 Ig‐like domains, arranged in tandem. Domains 18–51 are connected to each other through short 5‐residue linkers, and this arrangement has been previously shown to form a semi‐flexible rod in solution. Domains 1–18 generally have slightly longer ~7 residue interdomain linkers, and the multidomain structure and motion conferred by this kind of linker is understudied. Here, we use NMR, SAXS, and MD to show that these longer linkers are associated with significantly more domain/domain flexibility, with the resulting multidomain structure being moderately compact. Further examination of the relationship between interdomain flexibility and linker length shows there is a 5 residue “sweet spot” linker length that results in dual‐domain systems being extended, and conversely that both longer or shorter linkers result in a less extended structure. This detailed knowledge of the obscurin N terminus structure and flexibility allowed for mathematical modeling of domains 1–18, which suggests that this region likely forms tangles if left alone in solution. Given how infrequently protein tangles occur in nature, and given the pathological outcomes that occur when tangles do arise, our data suggest that obscurin is likely either significantly scaffolded or else externally extended in the cell.  more » « less
Award ID(s):
2024182 1757874
PAR ID:
10442599
Author(s) / Creator(s):
 ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Proteins: Structure, Function, and Bioinformatics
Volume:
91
Issue:
4
ISSN:
0887-3585
Page Range / eLocation ID:
p. 485-496
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Transcription factors are multidomain proteins with specific DNA binding and regulatory domains. In the human FoxP subfamily (FoxP1, FoxP2, FoxP3, and FoxP4) of transcription factors, a 90 residue-long disordered region links a Leucine Zipper (ZIP)—known to form coiled-coil dimers—and a Forkhead (FKH) domain—known to form domain swapping dimers. We used replica exchange discrete molecular dynamics simulations, single-molecule fluorescence experiments, and other biophysical tools to understand how domain tethering in FoxP1 impacts dimerization at ZIP and FKH domains and how DNA binding allosterically regulates their dimerization. We found that domain tethering promotes FoxP1 dimerization but inhibits a FKH domain-swapped structure. Furthermore, our findings indicate that the linker mediates the mutual organization and dynamics of ZIP and FKH domains, forming closed and open states with and without interdomain contacts, thus highlighting the role of the linkers in multidomain proteins. Finally, we found that DNA allosterically promotes structural changes that decrease the dimerization propensity of FoxP1. We postulate that, upon DNA binding, the interdomain linker plays a crucial role in the gene regulatory function of FoxP1. 
    more » « less
  2. Chromatin, a dynamic protein-DNA complex that regulates eukaryotic genome accessibility and essential functions, is composed of nucleosomes connected by linker DNA with each nucleosome consisting of DNA wrapped around an octamer of histones H2A, H2B, H3 and H4. Magic angle spinning solid-state nuclear magnetic resonance (NMR) spectroscopy can yield unique insights into histone structure and dynamics in condensed nucleosomes and nucleosome arrays representative of chromatin at physiological concentrations. Recently we used J-coupling-based solid-state NMR methods to investigate with residue-specific resolution the conformational dynamics of histone H3 N-terminal tails in 16-mer nucleosome arrays containing 15, 30 or 60 bp DNA linkers. Here, we probe the H3 core domain in the 16-mer arrays as a function of DNA linker lengthviadipolar coupling-based1H-detected solid-state NMR techniques. Specifically, we established nearly complete assignments of backbone chemical shifts for H3 core residues in arrays with 15–60 bp DNA linkers reconstituted with2H,13C,15N-labeled H3. Overall, these chemical shifts were similar irrespective of the DNA linker length indicating no major changes in H3 core conformation. Notably, however, multiple residues at the H3-nucleosomal DNA interface in arrays with 15 bp DNA linkers exhibited relatively pronounced differences in chemical shifts and line broadening compared to arrays with 30 and 60 bp linkers. These findings are consistent with increased heterogeneity in nucleosome packing and structural strain within arrays containing short DNA linkers that likely leads to side-chains of these interfacial residues experiencing alternate conformations or shifts in their rotamer populations relative to arrays with the longer DNA linkers. 
    more » « less
  3. Abstract Many proteins are composed of several domains that pack together into a complex tertiary structure. Multidomain proteins can be challenging for protein structure modeling, particularly those for which templates can be found for individual domains but not for the entire sequence. In such cases, homology modeling can generate high quality models of the domains but not for the orientations between domains. Small‐angle X‐ray scattering (SAXS) reports the structural properties of entire proteins and has the potential for guiding homology modeling of multidomain proteins. In this article, we describe a novel multidomain protein assembly modeling method, SAXSDom that integrates experimental knowledge from SAXS with probabilistic Input‐Output Hidden Markov model to assemble the structures of individual domains together. Four SAXS‐based scoring functions were developed and tested, and the method was evaluated on multidomain proteins from two public datasets. Incorporation of SAXS information improved the accuracy of domain assembly for 40 out of 46 critical assessment of protein structure prediction multidomain protein targets and 45 out of 73 multidomain protein targets from the ab initio domain assembly dataset. The results demonstrate that SAXS data can provide useful information to improve the accuracy of domain‐domain assembly. The source code and tool packages are available athttps://github.com/jianlin-cheng/SAXSDom. 
    more » « less
  4. Recently developed protein language models have enabled a variety of applications with the protein contextual embeddings they produce. Per-protein representations (each protein is represented as a vector of fixed dimension) can be derived via averaging the embeddings of individual residues, or applying matrix transformation techniques such as the discrete cosine transformation (DCT) to matrices of residue embeddings. Such protein-level embeddings have been applied to enable fast searches of similar proteins; however, limitations have been found; for example, PROST is good at detecting global homologs but not local homologs, and knnProtT5 excels for proteins with single domains but not multidomain proteins. Here, we propose a novel approach that first segments proteins into domains (or subdomains) and then applies the DCT to the vectorized embeddings of residues in each domain to infer domain-level contextual vectors. Our approach, called DCTdomain, uses predicted contact maps from ESM-2 for domain segmentation, which is formulated as adomain segmentationproblem and can be solved using arecursive cutalgorithm (RecCut in short) in quadratic time to the protein length; for comparison, an existing approach for domain segmentation uses a cubic-time algorithm. We show such domain-level contextual vectors (termed asDCT fingerprints) enable fast and accurate detection of similarity between proteins that share global similarities but with undefined extended regions between shared domains, and those that only share local similarities. In addition, tests on a database search benchmark show that the DCTdomain is able to detect distant homologs by leveraging the structural information in the contextual embeddings. 
    more » « less
  5. The scaffold protein PSD-95 links postsynaptic receptors to sites of presynaptic neurotransmitter release. Flexible linkers between folded domains in PSD-95 enable a dynamic supertertiary structure. Interdomain interactions within the PSG supramodule, formed by P DZ3, S H3, and G uanylate Kinase domains, regulate PSD-95 activity. Here we combined discrete molecular dynamics and single molecule Förster resonance energy transfer (FRET) to characterize the PSG supramodule, with time resolution spanning picoseconds to seconds. We used a FRET network to measure distances in full-length PSD-95 and model the conformational ensemble. We found that PDZ3 samples two conformational basins, which we confirmed with disulfide mapping. To understand effects on activity, we measured binding of the synaptic adhesion protein neuroligin. We found that PSD-95 bound neuroligin well at physiological pH while truncated PDZ3 bound poorly. Our hybrid structural models reveal how the supertertiary context of PDZ3 enables recognition of this critical synaptic ligand. 
    more » « less