skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on January 1, 2026

Title: Rapid response to fast viral evolution using AlphaFold 3-assisted topological deep learning
Abstract The fast evolution of SARS-CoV-2 and other infectious viruses poses a grand challenge to the rapid response in terms of viral tracking, diagnostics, and design and manufacture of monoclonal antibodies (mAbs) and vaccines, which are both time-consuming and costly. This underscores the need for efficient computational approaches. Recent advancements, like topological deep learning (TDL), have introduced powerful tools for forecasting emerging dominant variants, yet they require deep mutational scanning (DMS) of viral surface proteins and associated three-dimensional (3D) protein–protein interaction (PPI) complex structures. We propose an AlphaFold 3 (AF3)-assisted multi-task topological Laplacian (MT-TopLap) strategy to address this need. MT-TopLap combines deep learning with TDA models, such as persistent Laplacians (PL) to extract detailed topological and geometric characteristics of PPIs, thereby enhancing the prediction of DMS and binding free energy (BFE) changes upon virus mutations. Validation with four experimental DMS datasets of SARS-CoV-2 spike receptor-binding domain (RBD) and the human angiotensin-converting enzyme-2 (ACE2) complexes indicates that our AF3-assisted MT-TopLap strategy maintains robust performance, with only an average 1.1% decrease in Pearson correlation coefficients (PCC) and an average 9.3% increase in root mean square errors (RMSE), compared with the use of experimental structures. Additionally, AF3-assisted MT-TopLap achieved a PCC of 0.81 when tested with a SARS-CoV-2 HK.3 variant DMS dataset, confirming its capability to accurately predict BFE changes and adapt to new experimental data, thereby showcasing its potential for rapid and effective response to fast viral evolution.  more » « less
Award ID(s):
2052983
PAR ID:
10616131
Author(s) / Creator(s):
;
Publisher / Repository:
Virus Evolution
Date Published:
Journal Name:
Virus Evolution
Volume:
11
Issue:
1
ISSN:
2057-1577
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    In the global health emergency caused by coronavirus disease 2019 (COVID-19), efficient and specific therapies are urgently needed. Compared with traditional small-molecular drugs, antibody therapies are relatively easy to develop; they are as specific as vaccines in targeting severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2); and they have thus attracted much attention in the past few months. This article reviews seven existing antibodies for neutralizing SARS-CoV-2 with 3D structures deposited in the Protein Data Bank (PDB). Five 3D antibody structures associated with the SARS-CoV spike (S) protein are also evaluated for their potential in neutralizing SARS-CoV-2. The interactions of these antibodies with the S protein receptor-binding domain (RBD) are compared with those between angiotensin-converting enzyme 2 and RBD complexes. Due to the orders of magnitude in the discrepancies of experimental binding affinities, we introduce topological data analysis, a variety of network models, and deep learning to analyze the binding strength and therapeutic potential of the 14 antibody–antigen complexes. The current COVID-19 antibody clinical trials, which are not limited to the S protein target, are also reviewed. 
    more » « less
  2. Abstract Understanding the molecular evolution of the SARS‐CoV‐2 virus as it continues to spread in communities around the globe is important for mitigation and future pandemic preparedness. Three‐dimensional structures of SARS‐CoV‐2 proteins and those of other coronavirusess archived in the Protein Data Bank were used to analyze viral proteome evolution during the first 6 months of the COVID‐19 pandemic. Analyses of spatial locations, chemical properties, and structural and energetic impacts of the observed amino acid changes in >48 000 viral isolates revealed how each one of 29 viral proteins have undergone amino acid changes. Catalytic residues in active sites and binding residues in protein–protein interfaces showed modest, but significant, numbers of substitutions, highlighting the mutational robustness of the viral proteome. Energetics calculations showed that the impact of substitutions on the thermodynamic stability of the proteome follows a universal bi‐Gaussian distribution. Detailed results are presented for potential drug discovery targets and the four structural proteins that comprise the virion, highlighting substitutions with the potential to impact protein structure, enzyme activity, and protein–protein and protein–nucleic acid interfaces. Characterizing the evolution of the virus in three dimensions provides testable insights into viral protein function and should aid in structure‐based drug discovery efforts as well as the prospective identification of amino acid substitutions with potential for drug resistance. 
    more » « less
  3. Although COVID-19 transmission has been reduced by the advent of vaccinations and a variety of rapid monitoring techniques, the SARS-CoV-2 virus itself has shown a remarkable ability to mutate and persist. With this long track record of immune escape, researchers are still exploring prophylactic treatments to curtail future SARS-CoV-2 variants. Specifically, much focus has been placed on the antiviral lectin Griffithsin in preventing spike protein-mediated infection via the hACE2 receptor (direct infection). However, an oft-overlooked aspect of SARS-CoV-2 infection is viral capture by attachment receptors such as DC-SIGN, which is thought to facilitate the initial stages of COVID-19 infection in the lung tissue (called trans-infection). In addition, while immune escape is dictated by mutations in the spike protein, coronaviral virions also incorporate M, N, and E structural proteins within the particle. In this paper, we explored how several structural facets of both the SARS-CoV-2 virion and the antiviral lectin Griffithsin can affect and attenuate the infectivity of SARS-CoV-2 pseudovirus. We found that Griffithsin was a better inhibitor of hACE2-mediated direct infection when the coronaviral M protein is present compared to when it is absent (possibly providing an explanation regarding why Griffithsin shows better inhibition against authentic SARS-CoV-2 as opposed to pseudotyped viruses, which generally do not contain M) and that Griffithsin was not an effective inhibitor of DC-SIGN-mediated trans-infection. Furthermore, we found that DC-SIGN appeared to mediate trans-infection exclusively via binding to the SARS-CoV-2 spike protein, with no significant effect observed when other viral proteins (M, N, and/or E) were present. These results provide etiological data that may help to direct the development of novel antiviral treatments, either by leveraging Griffithsin binding to the M protein as a novel strategy to prevent SARS-CoV-2 infection or by narrowing efforts to inhibit trans-infection to focus on DC-SIGN binding to SARS-CoV-2 spike protein. 
    more » « less
  4. The emergence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has triggered a global COVID-19 pandemic, challenging healthcare systems worldwide. Effective therapeutic strategies against this novel coronavirus remain limited, underscoring the urgent need for innovative approaches. The present research investigates the potential of cannabis compounds as therapeutic agents against SARS-CoV-2 through their interaction with the virus’s papain-like protease (PLpro) protein, a crucial element in viral replication and immune evasion. Computational methods, including molecular docking and molecular dynamics (MD) simulations, were employed to screen cannabis compounds against PLpro and analyze their binding mechanisms and interaction patterns. The results showed cannabinoids with binding affinities ranging from −6.1 kcal/mol to −4.6 kcal/mol, forming interactions with PLpro. Notably, Cannabigerolic and Cannabidiolic acids exhibited strong binding contacts with critical residues in PLpro’s active region, indicating their potential as viral replication inhibitors. MD simulations revealed the dynamic behavior of cannabinoid–PLpro complexes, highlighting stable binding conformations and conformational changes over time. These findings shed light on the mechanisms underlying cannabis interaction with SARS-CoV-2 PLpro, aiding in the rational design of antiviral therapies. Future research will focus on experimental validation, optimizing binding affinity and selectivity, and preclinical assessments to develop effective treatments against COVID-19. 
    more » « less
  5. Abstract Predicting protein properties from amino acid sequences is an important problem in biology and pharmacology. Protein–protein interactions among SARS-CoV-2 spike protein, human receptors and antibodies are key determinants of the potency of this virus and its ability to evade the human immune response. As a rapidly evolving virus, SARS-CoV-2 has already developed into many variants with considerable variation in virulence among these variants. Utilizing the proteomic data of SARS-CoV-2 to predict its viral characteristics will, therefore, greatly aid in disease control and prevention. In this paper, we review and compare recent successful prediction methods based on long short-term memory (LSTM), transformer, convolutional neural network (CNN) and a similarity-based topological regression (TR) model and offer recommendations about appropriate predictive methodology depending on the similarity between training and test datasets. We compare the effectiveness of these models in predicting the binding affinity and expression of SARS-CoV-2 spike protein sequences. We also explore how effective these predictive methods are when trained on laboratory-created data and are tasked with predicting the binding affinity of the in-the-wild SARS-CoV-2 spike protein sequences obtained from the GISAID datasets. We observe that TR is a better method when the sample size is small and test protein sequences are sufficiently similar to the training sequence. However, when the training sample size is sufficiently large and prediction requires extrapolation, LSTM embedding and CNN-based predictive model show superior performance. 
    more » « less