skip to main content

Title: A Suggestion of Converting Protein Intrinsic Disorder to Structural Entropy Using Shannon’s Information Theory
We propose a framework to convert the protein intrinsic disorder content to structural entropy (H) using Shannon’s information theory (IT). The structural capacity (C), which is the sum of H and structural information (I), is equal to the amino acid sequence length of the protein. The structural entropy of the residues expands a continuous spectrum, ranging from 0 (fully ordered) to 1 (fully disordered), consistent with Shannon’s IT, which scores the fully-determined state 0 and the fully-uncertain state 1. The intrinsically disordered proteins (IDPs) in a living cell may participate in maintaining the high-energy-low-entropy state. In addition, under this framework, the biological functions performed by proteins and associated with the order or disorder of their 3D structures could be explained in terms of information-gains or entropy-losses, or the reverse processes.
; ; ; ; ;
Award ID(s):
1720215 1761839 1453078
Publication Date:
Journal Name:
Page Range or eLocation-ID:
Sponsoring Org:
National Science Foundation
More Like this
  1. Entropy should directly reflect the extent of disorder in proteins. By clustering structurally related proteins and studying the multiple-sequence-alignment of the sequences of these clusters, we were able to link between sequence, structure, and disorder information. We introduced several parameters as measures of fluctuations at a given MSA site and used these as representative of the sequence and structure entropy at that site. In general, we found a tendency for negative correlations between disorder and structure, and significant positive correlations between disorder and the fluctuations in the system. We also found evidence for residue-type conservation for those residues proximate to potentially disordered sites. Mutation at the disorder site itself appear to be allowed. In addition, we found positive correlation for disorder and accessible surface area, validating that disordered residues occur in exposed regions of proteins. Finally, we also found that fluctuations in the dihedral angles at the original mutated residue and disorder are positively correlated while dihedral angle fluctuations in spatially proximal residues are negatively correlated with disorder. Our results seem to indicate permissible variability in the disordered site, but greater rigidity in the parts of the protein with which the disordered site interacts. This is another indication that disorderedmore »residues are involved in protein function.« less
  2. Intrinsically disordered regions (IDRs) carry out many cellular functions and vary in length and placement in protein sequences. This diversity leads to variations in the underlying compositional biases, which were demonstrated for the short vs. long IDRs. We analyze compositional biases across four classes of disorder: fully disordered proteins; short IDRs; long IDRs; and binding IDRs. We identify three distinct biases: for the fully disordered proteins, the short IDRs and the long and binding IDRs combined. We also investigate compositional bias for putative disorder produced by leading disorder predictors and find that it is similar to the bias of the native disorder. Interestingly, the accuracy of disorder predictions across different methods is correlated with the correctness of the compositional bias of their predictions highlighting the importance of the compositional bias. The predictive quality is relatively low for the disorder classes with compositional bias that is the most different from the “generic” disorder bias, while being much higher for the classes with the most similar bias. We discover that different predictors perform best across different classes of disorder. This suggests that no single predictor is universally best and motivates the development of new architectures that combine models that target specific disordermore »classes.« less
  3. Abstract

    Identification of intrinsic disorder in proteins relies in large part on computational predictors, which demands that their accuracy should be high. Since intrinsic disorder carries out a broad range of cellular functions, it is desirable to couple the disorder and disorder function predictions. We report a computational tool, flDPnn, that provides accurate, fast and comprehensive disorder and disorder function predictions from protein sequences. The recent Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiment and results on other test datasets demonstrate that flDPnn offers accurate predictions of disorder, fully disordered proteins and four common disorder functions. These predictions are substantially better than the results of the existing disorder predictors and methods that predict functions of disorder. Ablation tests reveal that the high predictive performance stems from innovative ways used in flDPnn to derive sequence profiles and encode inputs. flDPnn’s webserver is available at

  4. Membrane bending is a ubiquitous cellular process that is required for membrane traffic, cell motility, organelle biogenesis, and cell division. Proteins that bind to membranes using specific structural features, such as wedge-like amphipathic helices and crescent-shaped scaffolds, are thought to be the primary drivers of membrane bending. However, many membrane-binding proteins have substantial regions of intrinsic disorder which lack a stable three-dimensional structure. Interestingly, many of these disordered domains have recently been found to form networks stabilized by weak, multivalent contacts, leading to assembly of protein liquid phases on membrane surfaces. Here we ask how membrane-associated protein liquids impact membrane curvature. We find that protein phase separation on the surfaces of synthetic and cell-derived membrane vesicles creates a substantial compressive stress in the plane of the membrane. This stress drives the membrane to bend inward, creating protein-lined membrane tubules. A simple mechanical model of this process accurately predicts the experimentally measured relationship between the rigidity of the membrane and the diameter of the membrane tubules. Discovery of this mechanism, which may be relevant to a broad range of cellular protrusions, illustrates that membrane remodeling is not exclusive to structured scaffolds but can also be driven by the rapidly emerging classmore »of liquid-like protein networks that assemble at membranes.

    « less
  5. Resting-state functional magnetic resonance imaging (rsfMRI) has become a widely used approach for detecting subtle differences in functional brain fluctuations in various studies of the healthy and disordered brain. Such studies are often based on temporal functional connectivity (i.e., the correlation between time courses derived from regions or networks within the fMRI data). While being successful for a number of tasks, temporal connectivity does not fully leverage the available spatial information. In this research study, we present a new perspective on spatial functional connectivity, which involves learning patterns of spatial coupling among brain networks by utilizing recent advances in deep learning as well as the contrastive learning framework. We show that we can learn domain-specific mappings of brain networks that can, in turn, be used to characterize differences between schizophrenia patients and control. Furthermore, we show that the coupling of intradomain networks in the controls is stronger than in patients suffering from the disorder. We also evaluate the coupling among networks of different domains and find various patterns of stronger or weaker coupling among certain domains, which provide additional insights about the brain.