In proteins, proline-aromatic sequences exhibit increased frequencies of cis-proline amide bonds, via proposed C–H/π interactions between the aromatic ring and either the proline ring or the backbone C–Hα of the residue prior to proline. These interactions would be expected to result in tryptophan, as the most electron-rich aromatic residue, exhibiting the highest frequency of cis-proline. However, prior results from bioinformatics studies on proteins and experiments on proline-aromatic sequences in peptides have not revealed a clear correlation between the properties of the aromatic ring and the population of cis-proline. An investigation of the effects of aromatic residue (aromatic ring properties) on the conformation of proline-aromatic sequences was conducted using three distinct approaches: (1) NMR spectroscopy in model peptides of the sequence Ac-TGPAr-NH2 (Ar = encoded and unnatural aromatic amino acids); (2) bioinformatics analysis of structures in proline-aromatic sequences in the PDB; and (3) computational investigation using DFT and MP2 methods on models of proline-aromatic sequences and interactions. C–H/π and hydrophobic interactions were observed to stabilize local structures in both the trans-proline and cis-proline conformations, with both proline amide conformations exhibiting C–H/π interactions between the aromatic ring and Hα of the residue prior to proline (Hα-trans-Pro-aromatic and Hα-cis-Pro-aromatic interactions) and/or with the proline ring (trans-ProH-aromatic and cis-ProH-aromatic interactions). These C–H/π interactions were strongest with tryptophan (Trp) and weakest with cationic histidine (HisH+). Aromatic interactions with histidine were modulated in strength by His ionization state. Proline-aromatic sequences were associated with specific conformational poses, including type I and type VI β-turns. C–H/π interactions at the pre-proline Hα, which were stronger than interactions at Pro, stabilize normally less favorable conformations, including the ζ or αL conformations at the pre-proline residue, cis-proline, and/or the g+ χ1 rotamer or αL conformation at the aromatic residue. These results indicate that proline-aromatic sequences, especially Pro-Trp sequences, are loci to nucleate turns, helices, loops, and other local structures in proteins. These results also suggest that mutations that introduce proline-aromatic sequences, such as the R406W mutation that is associated with protein misfolding and aggregation in the microtubule-binding protein tau, might result in substantial induced structure, particularly in intrinsically disordered regions of proteins.
more »
« less
Evaluation of acyllysine isostere interactions with the aromatic pocket of the AF9 YEATS domain
Abstract Amide−π interactions, in which an amide interacts with an aromatic group, are ubiquitous in biology, yet remain understudied relative to other noncovalent interactions. Recently, we demonstrated that an electrostatically tunable amide−π interaction is key to recognition of histone acyllysine by the AF9 YEATS domain, a reader protein which has emerged as a therapeutic target due to its dysregulation in cancer. Amide isosteres are commonly employed in drug discovery, often to prevent degradation by proteases, and have proven valuable in achieving selectivity when targeting epigenetic proteins. However, like amide−π interactions, interactions of amide isosteres with aromatic rings have not been thoroughly studied despite widespread use. Herein, we evaluate the recognition of a series of amide isosteres by the AF9 YEATS domain using genetic code expansion to evaluate the amide isostere−π interaction. We show that compared to the amide−π interaction with the native ligand, each isostere exhibits similar electrostatic tunability with an aromatic residue in the binding pocket, demonstrating that the isosteres maintain similar interactions with the aromatic residue. We identify a urea‐containing ligand that binds with enhanced affinity for the AF9 YEATS domain, offering a promising starting point for inhibitor development. Furthermore, we demonstrate that carbamate and urea isosteres of crotonyllysine are resistant to enzymatic removal by SIRT1, a protein that cleaves acyl post‐translational modifications, further indicating the potential of amide isosteres in YEATS domain inhibitor development. These results also provide experimental precedent for interactions of these common drug discovery moieties with aromatic rings that can inform computational methods.
more »
« less
- Award ID(s):
- 1757413
- PAR ID:
- 10392272
- Publisher / Repository:
- Wiley Blackwell (John Wiley & Sons)
- Date Published:
- Journal Name:
- Protein Science
- Volume:
- 32
- Issue:
- 1
- ISSN:
- 0961-8368
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Aromatic ring isosteres and rigidified saturated hydrocarbons are important motifs to enable drug discovery. Herein we disclose [2]-ladderanes as a class ofmeta-substituted aromatic ring isosteres and rigidified cyclohexanes. A straightforward synthesis of the building blocks is presented along with representative derivatization. Preliminary studies reveal that the [2]-ladderanes offer similar metabolic and physicochemical properties thus establishing this class of molecules as interesting motifs.more » « less
-
Abstract New drug production, from target identification to marketing approval, takes over 12 years and can cost around $2.6 billion. Furthermore, the COVID-19 pandemic has unveiled the urgent need for more powerful computational methods for drug discovery. Here, we review the computational approaches to predicting protein–ligand interactions in the context of drug discovery, focusing on methods using artificial intelligence (AI). We begin with a brief introduction to proteins (targets), ligands (e.g. drugs) and their interactions for nonexperts. Next, we review databases that are commonly used in the domain of protein–ligand interactions. Finally, we survey and analyze the machine learning (ML) approaches implemented to predict protein–ligand binding sites, ligand-binding affinity and binding pose (conformation) including both classical ML algorithms and recent deep learning methods. After exploring the correlation between these three aspects of protein–ligand interaction, it has been proposed that they should be studied in unison. We anticipate that our review will aid exploration and development of more accurate ML-based prediction strategies for studying protein–ligand interactions.more » « less
-
Molecular basis for differential recognition of an allosteric inhibitor by receptor tyrosine kinasesAbstract Understanding kinase‐inhibitor selectivity continues to be a major objective in kinase drug discovery. We probe the molecular basis of selectivity of an allosteric inhibitor (MSC1609119A‐1) of the insulin‐like growth factor‐I receptor kinase (IGF1RK), which has been shown to be ineffective for the homologous insulin receptor kinase (IRK). Specifically, we investigated the structural and energetic basis of the allosteric binding of this inhibitor to each kinase by combining molecular modeling, molecular dynamics (MD) simulations, and thermodynamic calculations. We predict the inhibitor conformation in the binding pocket of IRK and highlight that the charged residues in the histidine‐arginine‐aspartic acid (HRD) and aspartic acid‐phenylalanine‐glycine (DFG) motifs and the nonpolar residues in the binding pocket govern inhibitor interactions in the allosteric pocket of each kinase. We suggest that the conformational changes in the IGF1RK residues M1054 and M1079, movement of the ⍺C‐helix, and the conformational stabilization of the DFG motif favor the selectivity of the inhibitor toward IGF1RK. Our thermodynamic calculations reveal that the observed selectivity can be rationalized through differences observed in the electrostatic interaction energy of the inhibitor in each inhibitor/kinase complex and the hydrogen bonding interactions of the inhibitor with the residue V1063 in IGF1RK that are not attained with the corresponding residue V1060 in IRK. Overall, our study provides a rationale for the molecular basis of recognition of this allosteric inhibitor by IGF1RK and IRK, which is potentially useful in developing novel inhibitors with improved affinity and selectivity.more » « less
-
Abstract Designing protein-binding proteins is critical for drug discovery. However, artificial-intelligence-based design of such proteins is challenging due to the complexity of protein–ligand interactions, the flexibility of ligand molecules and amino acid side chains, and sequence–structure dependencies. We introduce PocketGen, a deep generative model that produces residue sequence and atomic structure of the protein regions in which ligand interactions occur. PocketGen promotes consistency between protein sequence and structure by using a graph transformer for structural encoding and a sequence refinement module based on a protein language model. The graph transformer captures interactions at multiple scales, including atom, residue and ligand levels. For sequence refinement, PocketGen integrates a structural adapter into the protein language model, ensuring that structure-based predictions align with sequence-based predictions. PocketGen can generate high-fidelity protein pockets with enhanced binding affinity and structural validity. It operates ten times faster than physics-based methods and achieves a 97% success rate, defined as the percentage of generated pockets with higher binding affinity than reference pockets. Additionally, it attains an amino acid recovery rate exceeding 63%.more » « less
An official website of the United States government
