skip to main content

Attention:

The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 11:00 PM ET on Thursday, October 10 until 2:00 AM ET on Friday, October 11 due to maintenance. We apologize for the inconvenience.


Title: Elucidation of protein function using computational docking and hotspot analysis by ClusPro and FTMap
Starting with a crystal structure of a macromolecule, computational structural modeling can help to understand the associated biological processes, structure and function, as well as to reduce the number of further experiments required to characterize a given molecular entity. In the past decade, two classes of powerful automated tools for investigating the binding properties of proteins have been developed: the protein–protein docking program ClusPro and the FTMap and FTSite programs for protein hotspot identification. These methods have been widely used by the research community by means of publicly available online servers, and models built using these automated tools have been reported in a large number of publications. Importantly, additional experimental information can be leveraged to further improve the predictive power of these approaches. Here, an overview of the methods and their biological applications is provided together with a brief interpretation of the results.  more » « less
Award ID(s):
2054251 1759472 1816314 1759277
NSF-PAR ID:
10343415
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Acta Crystallographica Section D Structural Biology
Volume:
78
Issue:
6
ISSN:
2059-7983
Page Range / eLocation ID:
690 to 697
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Motivation

    Kinase-regulated phosphorylation is a ubiquitous type of post-translational modification (PTM) in both eukaryotic and prokaryotic cells. Phosphorylation plays fundamental roles in many signalling pathways and biological processes, such as protein degradation and protein-protein interactions. Experimental studies have revealed that signalling defects caused by aberrant phosphorylation are highly associated with a variety of human diseases, especially cancers. In light of this, a number of computational methods aiming to accurately predict protein kinase family-specific or kinase-specific phosphorylation sites have been established, thereby facilitating phosphoproteomic data analysis.

    Results

    In this work, we present Quokka, a novel bioinformatics tool that allows users to rapidly and accurately identify human kinase family-regulated phosphorylation sites. Quokka was developed by using a variety of sequence scoring functions combined with an optimized logistic regression algorithm. We evaluated Quokka based on well-prepared up-to-date benchmark and independent test datasets, curated from the Phospho.ELM and UniProt databases, respectively. The independent test demonstrates that Quokka improves the prediction performance compared with state-of-the-art computational tools for phosphorylation prediction. In summary, our tool provides users with high-quality predicted human phosphorylation sites for hypothesis generation and biological validation.

    Availability and implementation

    The Quokka webserver and datasets are freely available at http://quokka.erc.monash.edu/.

    Supplementary information

    Supplementary data are available at Bioinformatics online.

     
    more » « less
  2. Many sensors and catalysts composed of proteins immobilized on inorganic materials have been reported over the past few decades. Despite some examples of functional protein–surface and protein–nanoparticle conjugates, thorough characterization of the biological–abiological interface at the heart of these materials and devices is often overlooked in lieu of demonstrating acceptable system performance. This has resulted in a focus on generating functioning protein-based devices without a concerted effort to develop reliable tools necessary to measure the fundamental properties of the bio–abio interface, such as surface concentration, biomolecular structure, and activity. In this Perspective, we discuss current methods used to characterize these critical properties of devices that operate by integrating a protein into both flat surfaces and nanoparticle materials. We highlight the advantages and drawbacks of each method as they relate to understanding the function of the protein–surface interface and explore the manner in which an informed understanding of this complex interaction leads directly to the advancement of protein-based materials and technology. 
    more » « less
  3. Abstract Motivation

    Protein structure prediction has been greatly improved by deep learning, but the contribution of different information is yet to be fully understood. This article studies the impacts of two kinds of information for structure prediction: template and multiple sequence alignment (MSA) embedding. Templates have been used by some methods before, such as AlphaFold2, RoseTTAFold and RaptorX. AlphaFold2 and RosetTTAFold only used templates detected by HHsearch, which may not perform very well on some targets. In addition, sequence embedding generated by pre-trained protein language models has not been fully explored for structure prediction. In this article, we study the impact of templates (including the number of templates, the template quality and how the templates are generated) on protein structure prediction accuracy, especially when the templates are detected by methods other than HHsearch. We also study the impact of sequence embedding (generated by MSATransformer and ESM-1b) on structure prediction.

    Results

    We have implemented a deep learning method for protein structure prediction that may take templates and MSA embedding as extra inputs. We study the contribution of templates and MSA embedding to structure prediction accuracy. Our experimental results show that templates can improve structure prediction on 71 of 110 CASP13 (13th Critical Assessment of Structure Prediction) targets and 47 of 91 CASP14 targets, and templates are particularly useful for targets with similar templates. MSA embedding can improve structure prediction on 63 of 91 CASP14 (14th Critical Assessment of Structure Prediction) targets and 87 of 183 CAMEO targets and is particularly useful for proteins with shallow MSAs. When both templates and MSA embedding are used, our method can predict correct folds (TMscore > 0.5) for 16 of 23 CASP14 FM targets and 14 of 18 Continuous Automated Model Evaluation (CAMEO) targets, outperforming RoseTTAFold by 5% and 7%, respectively.

    Availability and implementation

    Available at https://github.com/xluo233/RaptorXFold.

    Supplementary information

    Supplementary data are available at Bioinformatics online.

     
    more » « less
  4. Abstract

    Protein labeling strategies have been explored for decades to study protein structure, function, and regulation. Fluorescent labeling of a protein enables the study of protein‐protein interactions through biophysical methods such as microscale thermophoresis (MST). MST measures the directed motion of a fluorescently labeled protein in response to microscopic temperature gradients, and the protein's thermal mobility can be used to determine binding affinity. However, the stoichiometry and site specificity of fluorescent labeling are hard to control, and heterogeneous labeling can generate inaccuracies in binding measurements. Here, we describe an easy‐to‐apply protocol for high‐stoichiometric, site‐specific labeling of a protein at its N‐terminus withN‐hydroxysuccinimide (NHS) esters as a means to measure protein‐protein interaction affinity by MST. This protocol includes guidelines for NHS ester labeling, fluorescent‐labeled protein purification, and MST measurement using a labeled protein. As an example of the entire workflow, we additionally provide a protocol for labeling a ubiquitin E3 enzyme and testing ubiquitin E2‐E3 enzyme binding affinity. These methods are highly adaptable and can be extended for protein interaction studies in various biological and biochemical circumstances. © 2021 Wiley Periodicals LLC.

    This article was corrected on 18 July 2022. See the end of the full text for details.

    Basic Protocol 1: Labeling a protein of interest at its N‐terminus with NHS esters through stepwise reaction

    Alternate Protocol: Labeling a protein of interest at its N‐terminus with NHS esters through a one‐pot reaction

    Basic Protocol 2: Purifying the N‐terminal fluorescent‐labeled protein and determining its concentration and labeling efficiency

    Basic Protocol 3: Using MST to determine the binding affinity of an N‐terminal fluorescent‐labeled protein to a binding partner.

    Basic Protocol 4: NHS ester labeling of ubiquitin E3 ligase WWP2 and measurement of the binding affinity between WWP2 and an E2 conjugating enzyme by the MST binding assay

     
    more » « less
  5. Abstract

    An increasing number of protein structures are determined by cryo‐electron microscopy (cryo‐EM) and stored in the Electron Microscopy Data Bank (EMDB). To interpret determined cryo‐EM maps, several methods have been developed that model the tertiary structure of biomolecules, particularly proteins. Here we show how to use two such methods, VESPER and MAINMAST, which were developed in our group. VESPER is a method mainly for two purposes: fitting protein structure models into an EM map and aligning two EM maps locally or globally to capture their similarity. VESPER represents each EM map as a set of vectors pointing toward denser points. By considering matching the directions of vectors, in general, VESPER aligns maps better than conventional methods that only consider local densities of maps. MAINMAST is ade novoprotein modeling tool designed for EM maps with resolution of 3–5 Å or better. MAINMAST builds a protein main chain directly from a density map by tracing dense points in an EM map and connecting them using a tree‐graph structure. This article describes how to use these two tools using three illustrative modeling examples. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC.

    Basic Protocol 1: Protein structure model fitting using VESPER

    Alternate Protocol: Atomic model fitting using VESPER web server

    Basic Protocol 2: Proteinde novomodeling using MAINMAST

     
    more » « less