skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Identification and Ranking of Binding Sites from Structural Ensembles: Application to SARS-CoV-2
Target identification and evaluation is a critical step in the drug discovery process. Although time-intensive and complex, the challenge becomes even more acute in the realm of infectious disease, where the rapid emergence of new viruses, the swift mutation of existing targets, and partial effectiveness of approved antivirals can lead to outbreaks of significant public health concern. The COVID-19 pandemic, caused by the SARS-CoV-2 virus, serves as a prime example of this, where despite the allocation of substantial resources, Paxlovid is currently the only effective treatment. In that case, significant effort pre-pandemic had been expended to evaluate the biological target for the closely related SARS-CoV. In this work, we utilize the computational hot spot mapping method, FTMove, to rapidly identify and rank binding sites for a set of nine SARS-CoV-2 drug/potential drug targets. FTMove takes into account protein flexibility by mapping binding site hot spots across an ensemble of structures for a given target. To assess the applicability of the FTMove approach to a wide range of drug targets for viral pathogens, we also carry out a comprehensive review of the known SARS-CoV-2 ligandable sites. The approach is able to identify the vast majority of all known sites and a few additional sites, which may in fact be yet to be discovered as ligandable. Furthermore, a UMAP analysis of the FTMove features for each identified binding site is largely able to separate predicted sites with experimentally known binders from those without known binders. These results demonstrate the utility of FTMove to rapidly identify actionable sites across a range of targets for a given indication. As such, the approach is expected to be particularly useful for assessing target binding sites for any emerging pathogen, as well as for indications in other disease areas, and providing actionable starting points for structure-based drug design efforts.  more » « less
Award ID(s):
2200052
PAR ID:
10620801
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
MDPI
Date Published:
Journal Name:
Viruses
Volume:
16
Issue:
11
ISSN:
1999-4915
Page Range / eLocation ID:
1647
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Currently, there is neither effective antiviral drugs nor vaccine for coronavirus disease 2019 (COVID-19) caused by acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Due to its high conservativeness and low similarity with human genes, SARS-CoV-2 main protease (M pro ) is one of the most favorable drug targets. However, the current understanding of the molecular mechanism of M pro inhibition is limited by the lack of reliable binding affinity ranking and prediction of existing structures of M pro –inhibitor complexes. This work integrates mathematics ( i.e. , algebraic topology) and deep learning (MathDL) to provide a reliable ranking of the binding affinities of 137 SARS-CoV-2 M pro inhibitor structures. We reveal that Gly143 residue in M pro is the most attractive site to form hydrogen bonds, followed by Glu166, Cys145, and His163. We also identify 71 targeted covalent bonding inhibitors. MathDL was validated on the PDBbind v2016 core set benchmark and a carefully curated SARS-CoV-2 inhibitor dataset to ensure the reliability of the present binding affinity prediction. The present binding affinity ranking, interaction analysis, and fragment decomposition offer a foundation for future drug discovery efforts. 
    more » « less
  2. Abstract Given the severity of the SARS-CoV-2 pandemic, a major challenge is to rapidly repurpose existing approved drugs for clinical interventions. While a number of data-driven and experimental approaches have been suggested in the context of drug repurposing, a platform that systematically integrates available transcriptomic, proteomic and structural data is missing. More importantly, given that SARS-CoV-2 pathogenicity is highly age-dependent, it is critical to integrate aging signatures into drug discovery platforms. We here take advantage of large-scale transcriptional drug screens combined with RNA-seq data of the lung epithelium with SARS-CoV-2 infection as well as the aging lung. To identify robust druggable protein targets, we propose a principled causal framework that makes use of multiple data modalities. Our analysis highlights the importance of serine/threonine and tyrosine kinases as potential targets that intersect the SARS-CoV-2 and aging pathways. By integrating transcriptomic, proteomic and structural data that is available for many diseases, our drug discovery platform is broadly applicable. Rigorous in vitro experiments as well as clinical trials are needed to validate the identified candidate drugs. 
    more » « less
  3. Abstract SARS‐CoV‐2 (SCoV2) and its variants of concern pose serious challenges to the public health. The variants increased challenges to vaccines, thus necessitating for development of new intervention strategies including anti‐virals. Within the international Covid19‐NMR consortium, we have identified binders targeting the RNA genome of SCoV2. We established protocols for the production and NMR characterization of more than 80 % of all SCoV2 proteins. Here, we performed an NMR screening using a fragment library for binding to 25 SCoV2 proteins and identified hits also against previously unexplored SCoV2 proteins. Computational mapping was used to predict binding sites and identify functional moieties (chemotypes) of the ligands occupying these pockets. Striking consensus was observed between NMR‐detected binding sites of the main protease and the computational procedure. Our investigation provides novel structural and chemical space for structure‐based drug design against the SCoV2 proteome. 
    more » « less
  4. The pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in 2020 has impacted daily life globally for over a year. While multiple vaccines have been authorized for emergency use and one oral medication has entered clinical trials, we are still seeking antiviral drugs for a long-term treatment for SARS-CoV-2 as well as other coronaviruses. Computational drug screenings of two SARS-CoV-2 protein target candidates are presented in this thesis: the nidoviral RNA uridylate-specific endoribonuclease (Nsp15) and the main protease (Mpro) of SARS-CoV-2. Nonstructural proteins of coronaviruses were selected as targets as they are more conserved across coronavirus strains than structural proteins. High throughput virtual screening of small molecule libraries including DrugBank and ZINC 15 resulted in several promising compounds for each of these targets. Molecular dynamics simulation allowed us to predict the binding energies for these compounds using molecular mechanics with generalized born surface area solvation calculations (MM-GBSA). Four top compounds were discovered for Nsp15 and eight compounds for Mpro. 
    more » « less
  5. The global COVID-19 pandemic has highlighted the need for rapid, reliable, and efficient detection of biological agents and the necessity of tracking changes in genetic material as new SARS-CoV-2 variants emerge. Here we demonstrate that RNA-based, single-molecule conductance experiments can be used to identify specific variants of SARS-CoV-2. To this end, we i) select target sequences of interest for specific variants, ii) utilize single-molecule break junction measurements to obtain conductance histograms for each sequence and its potential mutations, and iii) employ the XGBoost machine learning classifier to rapidly identify the presence of target molecules in solution with a limited number of conductance traces. This approach allows high-specificity and high-sensitivity detection of RNA target sequences less than 20 base pairs in length by utilizing a complementary DNA probe capable of binding to the specific target. We use this approach to directly detect SARS-CoV-2 variants of concerns B.1.1.7 (Alpha), B.1.351 (Beta), B.1.617.2 (Delta), and B.1.1.529 (Omicron) and further demonstrate that the specific sequence conductance is sensitive to nucleotide mismatches, thus broadening the identification capabilities of the system. Thus, our experimental methodology detects specific SARS-CoV-2 variants, as well as recognizes the emergence of new variants as they arise. 
    more » « less