skip to main content


Title: Stabilization of the SARS-CoV-2 receptor binding domain by protein core redesign and deep mutational scanning
Abstract

Stabilizing antigenic proteins as vaccine immunogens or diagnostic reagents is a stringent case of protein engineering and design as the exterior surface must maintain recognition by receptor(s) and antigen—specific antibodies at multiple distinct epitopes. This is a challenge, as stability enhancing mutations must be focused on the protein core, whereas successful computational stabilization algorithms typically select mutations at solvent-facing positions. In this study, we report the stabilization of SARS-CoV-2 Wuhan Hu-1 Spike receptor binding domain using a combination of deep mutational scanning and computational design, including the FuncLib algorithm. Our most successful design encodes I358F, Y365W, T430I, and I513L receptor binding domain mutations, maintains recognition by the receptor ACE2 and a panel of different anti-receptor binding domain monoclonal antibodies, is between 1 and 2°C more thermally stable than the original receptor binding domain using a thermal shift assay, and is less proteolytically sensitive to chymotrypsin and thermolysin than the original receptor binding domain. Our approach could be applied to the computational stabilization of a wide range of proteins without requiring detailed knowledge of active sites or binding epitopes. We envision that this strategy may be particularly powerful for cases when there are multiple or unknown binding sites.

 
more » « less
NSF-PAR ID:
10393273
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Protein Engineering, Design and Selection
Volume:
35
ISSN:
1741-0126
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Infection by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) involves the attachment of the receptor-binding domain (RBD) of its spike proteins to the ACE2 receptors on the peripheral membrane of host cells. Binding is initiated by a down-to-up conformational change in the spike protein, the change that presents the RBD to the receptor. To date, computational and experimental studies that search for therapeutics have concentrated, for good reason, on the RBD. However, the RBD region is highly prone to mutations, and is therefore a hotspot for drug resistance. In contrast, we here focus on the correlations between the RBD and residues distant to it in the spike protein. This allows for a deeper understanding of the underlying molecular recognition events and prediction of the highest-effect key mutations in distant, allosteric sites, with implications for therapeutics. Also, these sites can appear in emerging mutants with possibly higher transmissibility and virulence, and preidentifying them can give clues for designing pan-coronavirus vaccines against future outbreaks. Our model, based on time-lagged independent component analysis (tICA) and protein graph connectivity network, is able to identify multiple residues that exhibit long-distance coupling with the RBD opening. Residues involved in the most ubiquitous D614G mutation and the A570D mutation of the highly contagious UK SARS-CoV-2 variant are predicted ab initio from our model. Conversely, broad-spectrum therapeutics like drugs and monoclonal antibodies can target these key distant-but-conserved regions of the spike protein. 
    more » « less
  2. Abstract

    The glycosylation on the spike (S) protein of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus that causes COVID-19, modulates the viral infection by altering conformational dynamics, receptor interaction and host immune responses. Several variants of concern (VOCs) of SARS-CoV-2 have evolved during the pandemic, and crucial mutations on the S protein of the virus have led to increased transmissibility and immune escape. In this study, we compare the site-specific glycosylation and overall glycomic profiles of the wild type Wuhan-Hu-1 strain (WT) S protein and five VOCs of SARS-CoV-2: Alpha, Beta, Gamma, Delta and Omicron. Interestingly, both N- and O-glycosylation sites on the S protein are highly conserved among the spike mutant variants, particularly at the sites on the receptor-binding domain (RBD). The conservation of glycosylation sites is noteworthy, as over 2 million SARS-CoV-2 S protein sequences have been reported with various amino acid mutations. Our detailed profiling of the glycosylation at each of the individual sites of the S protein across the variants revealed intriguing possible association of glycosylation pattern on the variants and their previously reported infectivity. While the sites are conserved, we observed changes in the N- and O-glycosylation profile across the variants. The newly emerged variants, which showed higher resistance to neutralizing antibodies and vaccines, displayed a decrease in the overall abundance of complex-type glycans with both fucosylation and sialylation and an increase in the oligomannose-type glycans across the sites. Among the variants, the glycosylation sites with significant changes in glycan profile were observed at both theN-terminal domain and RBD of S protein, with Omicron showing the highest deviation. The increase in oligomannose-type happens sequentially from Alpha through Delta. Interestingly, Omicron does not contain more oligomannose-type glycans compared to Delta but does contain more compared to the WT and other VOCs. O-glycosylation at the RBD showed lower occupancy in the VOCs in comparison to the WT. Our study on the sites and pattern of glycosylation on the SARS-CoV-2 S proteins across the VOCs may help to understand how the virus evolved to trick the host immune system. Our study also highlights how the SARS-CoV-2 virus has conserved bothN- andO- glycosylation sites on the S protein of the most successful variants even after undergoing extensive mutations, suggesting a correlation between infectivity/ transmissibility and glycosylation.

     
    more » « less
  3. Abstract

    The continued emergence of new SARS‐CoV‐2 variants has accentuated the growing need for fast and reliable methods for the design of potentially neutralizing antibodies (Abs) to counter immune evasion by the virus. Here, we report on the de novo computational design of high‐affinity Ab variable regions (Fv) through the recombination of VDJ genes targeting the most solvent‐exposed hACE2‐binding residues of the SARS‐CoV‐2 spike receptor binding domain (RBD) protein using the software toolOptMAVEn‐2.0. Subsequently, we carried out computational affinity maturation of the designed variable regions through amino acid substitutions for improved binding with the target epitope. Immunogenicity of designs was restricted by preferring designs that match sequences from a 9‐mer library of “human Abs” based on a human string content score. We generated 106 different antibody designs and reported in detail on the top five that trade‐off the greatest computational binding affinity for the RBD with human string content scores. We further describe computational evaluation of the top five designs produced byOptMAVEn‐2.0using a Rosetta‐based approach. We used RosettaSnugDockfor local docking of the designs to evaluate their potential to bind the spike RBD and performed “forward folding” withDeepAbto assess their potential to fold into the designed structures. Ultimately, our results identified one designed Ab variable region, P1.D1, as a particularly promising candidate for experimental testing. This effort puts forth a computational workflow for the de novo design and evaluation of Abs that can quickly be adapted to target spike epitopes of emerging SARS‐CoV‐2 variants or other antigenic targets.

     
    more » « less
  4. Longnecker, Richard M. (Ed.)
    ABSTRACT A cascade of protein-protein interactions between four herpes simplex virus (HSV) glycoproteins (gD, gH/gL, and gB) drive fusion between the HSV envelope and host membrane, thereby allowing for virus entry and infection. Specifically, binding of gD to one of its receptors induces a conformational change that allows gD to bind to the regulatory complex gH/gL, which then activates the fusogen gB, resulting in membrane fusion. Using surface plasmon resonance and a panel of anti-gD monoclonal antibodies (MAbs) that sterically blocked the interaction, we previously showed that gH/gL binds directly to gD at sites distinct from the gD receptor binding site. Here, using an analogous strategy, we first evaluated the ability of a panel of uncharacterized anti-gH/gL MAbs to block binding to gD and/or inhibit fusion. We found that the epitopes of four gD-gH/gL-blocking MAbs were located within flexible regions of the gH N terminus and the gL C terminus, while the fifth was placed around gL residue 77. Taken together, our data localized the gD binding region on gH/gL to a group of gH and gL residues at the membrane distal region of the heterodimer. Surprisingly, a second set of MAbs did not block gD-gH/gL binding but instead stabilized the complex by altering the kinetic binding. However, despite this prolonged gD-gH/gL interaction, “stabilizing” MAbs also inhibited cell-cell fusion, suggesting a unique mechanism by which the fusion process is halted. Our findings support targeting the gD-gH/gL interaction to prevent fusion in both therapeutic and vaccine strategies against HSV. IMPORTANCE Key to developing a human HSV vaccine is an understanding of the virion glycoproteins involved in entry. HSV employs multiple glycoproteins for attachment, receptor interaction, and membrane fusion. Determining how these proteins function was resolved, in part, by structural biology coupled with immunological and biologic evidence. After binding, virion gD interacts with a receptor to activate the regulator gH/gL complex, triggering gB to drive fusion. Multiple questions remain, one being the physical location of each glycoprotein interaction site. Using protective antibodies with known epitopes, we documented the long-sought interaction between gD and gH/gL, detailing the region on gD important to create the gD-gH/gL triplex. Now, we have identified the corresponding gD contact sites on gH/gL. Concurrently we discovered a novel mechanism whereby gH/gL antibodies stabilize the complex and inhibit fusion progression. Our model for the gD-gH/gL triplex provides a new framework for studying fusion, which identifies targets for vaccine development. 
    more » « less
  5. Abstract Motivation

    Disruption of protein–protein interactions can mitigate antibody recognition of therapeutic proteins, yield monomeric forms of oligomeric proteins, and elucidate signaling mechanisms, among other applications. While designing affinity-enhancing mutations remains generally quite challenging, both statistically and physically based computational methods can precisely identify affinity-reducing mutations. In order to leverage this ability to design variants of a target protein with disrupted interactions, we developed the DisruPPI protein design method (DISRUpting Protein–Protein Interactions) to optimize combinations of mutations simultaneously for both disruption and stability, so that incorporated disruptive mutations do not inadvertently affect the target protein adversely.

    Results

    Two existing methods for predicting mutational effects on binding, FoldX and INT5, were demonstrated to be quite precise in selecting disruptive mutations from the SKEMPI and AB-Bind databases of experimentally determined changes in binding free energy. DisruPPI was implemented to use an INT5-based disruption score integrated with an AMBER-based stability assessment and was applied to disrupt protein interactions in a set of different targets representing diverse applications. In retrospective evaluation with three different case studies, comparison of DisruPPI-designed variants to published experimental data showed that DisruPPI was able to identify more diverse interaction-disrupting and stability-preserving variants more efficiently and effectively than previous approaches. In prospective application to an interaction between enhanced green fluorescent protein (EGFP) and a nanobody, DisruPPI was used to design five EGFP variants, all of which were shown to have significantly reduced nanobody binding while maintaining function and thermostability. This demonstrates that DisruPPI may be readily utilized for effective removal of known epitopes of therapeutically relevant proteins.

    Availability and implementation

    DisruPPI is implemented in the EpiSweep package, freely available under an academic use license.

    Supplementary information

    Supplementary data are available at Bioinformatics online.

     
    more » « less