skip to main content

Title: PolyFold: An interactive visual simulator for distance-based protein folding
Recent advances in distance-based protein folding have led to a paradigm shift in protein structure prediction. Through sufficiently precise estimation of the inter-residue distance matrix for a protein sequence, it is now feasible to predict the correct folds for new proteins much more accurately than ever before. Despite the exciting progress, a dedicated visualization system that can dynamically capture the distance-based folding process is still lacking. Most molecular visualizers typically provide only a static view of a folded protein conformation, but do not capture the folding process. Even among the selected few graphical interfaces that do adopt a dynamic perspective, none of them are distance-based. Here we present PolyFold, an interactive visual simulator for dynamically capturing the distance-based protein folding process through real-time rendering of a distance matrix and its compatible spatial conformation as it folds in an intuitive and easy-to-use interface. PolyFold integrates highly convergent stochastic optimization algorithms with on-demand customizations and interactive manipulations to maximally satisfy the geometric constraints imposed by a distance matrix. PolyFold is capable of simulating the complex process of protein folding even on modest personal computers, thus making it accessible to the general public for fostering citizen science. Open source code of PolyFold is more » freely available for download at . It is implemented in cross-platform Java and binary executables are available for macOS, Linux, and Windows. « less
; ; ;
Zhang, Yang
Award ID(s):
2030722 1942692 2208679
Publication Date:
Journal Name:
Page Range or eLocation-ID:
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    In this work, we demonstrate the development of a rapid and label-free electrochemical biosensor to detectListeria monocytogenesusing a novel stimulus–response thiomer nanobrush material. Nanobrushes were developed via one-step simultaneous co-deposition of nanoplatinum (Pt) and alginate thiomers (ALG-thiomer). ALG-thiomer/Pt nanobrush platform significantly increased the average electroactive surface area of electrodes by 7 folds and maintained the actuation properties (pH-stimulated osmotic swelling) of the alginate. Dielectric behavior during brush actuation was characterized with positively, neutral, and negatively charged redox probes above and below the isoelectric point of alginate, indicating ALG-thiomer surface charge plays an important role in signal acquisition. The ALG-thiomer platform was biofunctionalized with an aptamer selective for the internalin A protein onListeriafor biosensing applications. Aptamer loading was optimized and various cell capture strategies were investigated (brush extended versus collapsed). Maximum cell capture occurs when the ALG-thiomer/aptamer is in the extended conformation (pH > 3.5), followed by impedance measurement in the collapsed conformation (pH < 3.5). Low concentrations of bacteria (5 CFU mL−1) were sensed from a complex food matrix (chicken broth) and selectivity testing against other Gram-positive bacteria (Staphylococcus aureus) indicate the aptamer affinity is maintained, even at these pH values. The new hybrid soft material is among the most efficient and fastest (17 min) forL.more »monocytogenesbiosensing to date, and does not require sample pretreatment, constituting a promising new material platform for sensing small molecules or cells.

    « less
  2. Abstract. Recent research on fold-switching metamorphic proteins has revealed some notable exceptions to Anfinsen's hypothesis of protein folding. We have previously described how a single point mutation can enable a well-folded protein domain, one of the two PAS (Per-ARNT-Sim) domains of the human ARNT (aryl hydrocarbon receptor nuclear translocator) protein, to interconvert between two conformers related by a slip of an internal β strand. Using this protein as a test case, we advance the concept of a “fragile fold”, a protein fold that can reversibly rearrange into another fold that differs by a substantial number of hydrogen bonds, entailing reorganization of single secondary structure elements to more drastic changes seen in metamorphic proteins. Here we use a battery of biophysical tests to examine several factors affecting the equilibrium between the two conformations of the switching ARNT PAS-B Y456T protein. Of note is that we find that factors which impact the HI loop preceding the shifted Iβ strand affect both the equilibrium levels of the two conformers and the denatured state which links them in the interconversion process. Finally, we describe small molecules that selectively bind to and stabilize the wild-type conformation of ARNT PAS-B. These studies form a toolkit for studying fragile proteinmore »folds and could enable ways to modulate the biological functions of such fragile folds, both in natural and engineered proteins.« less
  3. Abstract

    Human telomere overhang composed of tandem repeats of TTAGGG folds into G-quadruplex (G4). Unlike in an experimental setting in the test tube in which the entire length is allowed to fold at once, inside the cell, the overhang is expected to fold as it is synthesized directionally (5′ to 3′) and released segmentally by a specialized enzyme, the telomerase. To mimic such vectorial G4 folding process, we employed a superhelicase, Rep-X which can unwind DNA to release the TTAGGG repeats in 5′ to 3′ direction. We demonstrate that the folded conformation achieved by the refolding of full sequence is significantly different from that of the vectorial folding for two to eight TTAGGG repeats. Strikingly, the vectorially folded state leads to a remarkably higher accessibility to complementary C-rich strand and the telomere binding protein POT1, reflecting a less stably folded state resulting from the vectorial folding. Importantly, our study points to an inherent difference between the co-polymerizing and post-polymerized folding of telomere overhang that can impact telomere architecture and downstream processes.

  4. Abstract
    <p>This data set for the manuscript entitled &#34;Design of Peptides that Fold and Self-Assemble on Graphite&#34; includes all files needed to run and analyze the simulations described in the this manuscript in the molecular dynamics software NAMD, as well as the output of the simulations. The files are organized into directories corresponding to the figures of the main text and supporting information. They include molecular model structure files (NAMD psf or Amber prmtop format), force field parameter files (in CHARMM format), initial atomic coordinates (pdb format), NAMD configuration files, Colvars configuration files, NAMD log files, and NAMD output including restart files (in binary NAMD format) and trajectories in dcd format (downsampled to 10 ns per frame). Analysis is controlled by shell scripts (Bash-compatible) that call VMD Tcl scripts or python scripts. These scripts and their output are also included.</p> <p>Version: 2.0</p> <p>Changes versus version 1.0 are the addition of the free energy of folding, adsorption, and pairing calculations (Sim_Figure-7) and shifting of the figure numbers to accommodate this addition.</p> <p><br /> Conventions Used in These Files<br /> &#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;&#61;</p> <p>Structure Files<br /> ----------------<br /> - graph_*.psf or sol_*.psf (original NAMD (XPLOR?) format psf file including atom details (type, charge, mass),More>>
  5. Abstract

    Sequence-based contact prediction has shown considerable promise in assisting non-homologous structure modeling, but it often requires many homologous sequences and a sufficient number of correct contacts to achieve correct folds. Here, we developed a method, C-QUARK, that integrates multiple deep-learning and coevolution-based contact-maps to guide the replica-exchange Monte Carlo fragment assembly simulations. The method was tested on 247 non-redundant proteins, where C-QUARK could fold 75% of the cases with TM-scores (template-modeling scores) ≥0.5, which was 2.6 times more than that achieved by QUARK. For the 59 cases that had either low contact accuracy or few homologous sequences, C-QUARK correctly folded 6 times more proteins than other contact-based folding methods. C-QUARK was also tested on 64 free-modeling targets from the 13th CASP (critical assessment of protein structure prediction) experiment and had an average GDT_TS (global distance test) score that was 5% higher than the best CASP predictors. These data demonstrate, in a robust manner, the progress in modeling non-homologous protein structures using low-accuracy and sparse contact-map predictions.