skip to main content

Title: Benchmarking ensemble docking methods in D3R Grand Challenge 4

The discovery of new drugs is a time consuming and expensive process. Methods such as virtual screening, which can filter out ineffective compounds from drug libraries prior to expensive experimental study, have become popular research topics. As the computational drug discovery community has grown, in order to benchmark the various advances in methodology, organizations such as the Drug Design Data Resource have begun hosting blinded grand challenges seeking to identify the best methods for ligand pose-prediction, ligand affinity ranking, and free energy calculations. Such open challenges offer a unique opportunity for researchers to partner with junior students (e.g., high school and undergraduate) to validate basic yet fundamental hypotheses considered to be uninteresting to domain experts. Here, we, a group of high school-aged students and their mentors, present the results of our participation in Grand Challenge 4 where we predicted ligand affinity rankings for the Cathepsin S protease, an important protein target for autoimmune diseases. To investigate the effect of incorporating receptor dynamics on ligand affinity rankings, we employed the Relaxed Complex Scheme, a molecular docking method paired with molecular dynamics-generated receptor conformations. We found that Cathepsin S is a difficult target for molecular docking and we explore some advanced more » methods such as distance-restrained docking to try to improve the correlation with experiments. This project has exemplified the capabilities of high school students when supported with a rigorous curriculum, and demonstrates the value of community-driven competitions for beginners in computational drug discovery.

« less
; ; ; ; ; ;
Publication Date:
Journal Name:
Journal of Computer-Aided Molecular Design
Page Range or eLocation-ID:
p. 87-99
Springer Science + Business Media
Sponsoring Org:
National Science Foundation
More Like this
  1. While the COVID-19 pandemic continues to worsen, effective medicines that target the life cycle of SARS-CoV-2 are still under development. As more highly infective and dangerous variants of the coronavirus emerge, the protective power of vaccines will decrease or vanish. Thus, the development of drugs, which are free of drug resistance is direly needed. The aim of this study is to identify allosteric binding modulators from a large compound library to inhibit the binding between the Spike protein of the SARS-CoV-2 virus and human angiotensin-converting enzyme 2 (hACE2). The binding of the Spike protein to hACE2 is the first step of the infection of host cells by the coronavirus. We first built a compound library containing 77 448 antiviral compounds. Molecular docking was then conducted to preliminarily screen compounds which can potently bind to the Spike protein at two allosteric binding sites. Next, molecular dynamics simulations were performed to accurately calculate the binding affinity between the spike protein and an identified compound from docking screening and to investigate whether the compound can interfere with the binding between the Spike protein and hACE2. We successfully identified two possible drug binding sites on the Spike protein and discovered a series of antiviral compoundsmore »which can weaken the interaction between the Spike protein and hACE2 receptor through conformational changes of the key Spike residues at the Spike–hACE2 binding interface induced by the binding of the ligand at the allosteric binding site. We also applied our screening protocol to another compound library which consists of 3407 compounds for which the inhibitory activities of Spike/hACE2 binding were measured. Encouragingly, in vitro data supports that the identified compounds can inhibit the Spike–ACE2 binding. Thus, we developed a promising computational protocol to discover allosteric inhibitors of the binding of the Spike protein of SARS-CoV-2 to the hACE2 receptor, and several promising allosteric modulators were discovered.« less
  2. Virtual screening is a cost- and time-effective alternative to traditional high-throughput screening in the drug discovery process. Both virtual screening approaches, structure-based molecular docking and ligand-based cheminformatics, suffer from computational cost, low accuracy, and/or reliance on prior knowledge of a ligand that binds to a given target. Here, we propose a neural network framework, NeuralDock, which accelerates the process of high-quality computational docking by a factor of 10 6 , and does not require prior knowledge of a ligand that binds to a given target. By approximating both protein-small molecule conformational sampling and energy-based scoring, NeuralDock accurately predicts the binding energy, and affinity of a protein-small molecule pair, based on protein pocket 3D structure and small molecule topology. We use NeuralDock and 25 GPUs to dock 937 million molecules from the ZINC database against superoxide dismutase-1 in 21 h, which we validate with physical docking using MedusaDock. Due to its speed and accuracy, NeuralDock may be useful in brute-force virtual screening of massive chemical libraries and training of generative drug models.
  3. Science still does not have the ability to accurately predict the affinity that ligands have for proteins. In an attempt to address this, the Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) series of blind predictive challenges is a community-wide exercise aimed at advancing computational techniques as standard predictive tools in rational drug design. In each cycle, a range of biologically relevant systems of different levels of complexity are selected to test the latest modeling methods. As part of this on-going exercise, and as a step towards understanding the important factors in context dependent guest binding, we challenged the computational community to determine the affinity of a series of negatively and positively charged guests to two constitutionally isomeric cavitand hosts: octa-acid 1 , and exo -octa acid 2 . Our affinity determinations, combined with molecular dynamics simulations, reveal asymmetries in affinities between host–guest pairs that cannot alone be explained by simple coulombic interactions, but also point to the importance of host–water interactions. Our work reveals the key facets of molecular recognition in water, emphasizes where improvements need to be made in modelling, and shed light on the complex problem of ligand-protein binding in the aqueous realm.
  4. Background: Estrogen Receptors (ER) are members of the nuclear intracellular receptorsfamily. ER once activated by estrogen, it binds to DNA via translocating into the nucleus and regulatesthe activity of various genes. Withaferin A (WA) - an active compound of a medicinal plant Withaniasomnifera was reported to be a very effective anti-cancer agent and some of the recent studies hasdemonstrated that WA is capable of arresting the development of breast cancer via targeting estrogenreceptor. Objective: The present study is aimed at understanding the molecular level interactions of ER and Tamoxifenin comparison to Withaferin A using In-silico approaches with emphasis on Withaferin Abinding capability with ER in presence of point mutations which are causing de novo drug resistance toexisting drugs like Tamoxifen. Methods: Molecular modeling and docking studies were performed for the Tamoxifen and WithaferinA with the Estrogen receptor. Molecular docking simulations of estrogen receptor in complex withTamoxifen and Withaferin A were also performed. Results: Amino acid residues, Glu353, Arg394 and Leu387 was observed as crucial for binding andstabilizing the protein-ligand complex in case of Tamoxifen and Withaferin-A. The potential ofWithaferin A to overcome the drug resistance caused by the mutations in estrogen receptor to the existingdrugs such as Tamoxifen was demonstrated.more »Conclusion: In-silico analysis has elucidated the binding mode and molecular level interactions whichare expected to be of great help in further optimizing Withaferin A or design / discovery of futurebreast cancer inhibitors targeting estrogen receptor.« less
  5. Abstract New drug production, from target identification to marketing approval, takes over 12 years and can cost around $2.6 billion. Furthermore, the COVID-19 pandemic has unveiled the urgent need for more powerful computational methods for drug discovery. Here, we review the computational approaches to predicting protein–ligand interactions in the context of drug discovery, focusing on methods using artificial intelligence (AI). We begin with a brief introduction to proteins (targets), ligands (e.g. drugs) and their interactions for nonexperts. Next, we review databases that are commonly used in the domain of protein–ligand interactions. Finally, we survey and analyze the machine learning (ML) approaches implemented to predict protein–ligand binding sites, ligand-binding affinity and binding pose (conformation) including both classical ML algorithms and recent deep learning methods. After exploring the correlation between these three aspects of protein–ligand interaction, it has been proposed that they should be studied in unison. We anticipate that our review will aid exploration and development of more accurate ML-based prediction strategies for studying protein–ligand interactions.