skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: When are two hydrogen bonds better than one? Accurate first-principles models explain the balance of hydrogen bond donors and acceptors found in proteins
Hydrogen bonds (HBs) play an essential role in the structure and catalytic action of enzymes, but a complete understanding of HBs in proteins challenges the resolution of modern structural ( i.e. , X-ray diffraction) techniques and mandates computationally demanding electronic structure methods from correlated wavefunction theory for predictive accuracy. Numerous amino acid sidechains contain functional groups ( e.g. , hydroxyls in Ser/Thr or Tyr and amides in Asn/Gln) that can act as either HB acceptors or donors (HBA/HBD) and even form simultaneous, ambifunctional HB interactions. To understand the relative energetic benefit of each interaction, we characterize the potential energy surfaces of representative model systems with accurate coupled cluster theory calculations. To reveal the relationship of these energetics to the balance of these interactions in proteins, we curate a set of 4000 HBs, of which >500 are ambifunctional HBs, in high-resolution protein structures. We show that our model systems accurately predict the favored HB structural properties. Differences are apparent in HBA/HBD preference for aromatic Tyr versus aliphatic Ser/Thr hydroxyls because Tyr forms significantly stronger O–H⋯O HBs than N–H⋯O HBs in contrast to comparable strengths of the two for Ser/Thr. Despite this residue-specific distinction, all models of residue pairs indicate an energetic benefit for simultaneous HBA and HBD interactions in an ambifunctional HB. Although the stabilization is less than the additive maximum due both to geometric constraints and many-body electronic effects, a wide range of ambifunctional HB geometries are more favorable than any single HB interaction.  more » « less
Award ID(s):
1704266
PAR ID:
10287724
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Chemical Science
Volume:
12
Issue:
3
ISSN:
2041-6520
Page Range / eLocation ID:
1147 to 1162
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Hydrogen bonds (HB)s are the most abundant motifs in biological systems. They play a key role in determining protein–ligand binding affinity and selectivity. We designed two pharmaceutically beneficial HB databases, database A including ca. 12,000 protein–ligand complexes with ca. 22,000 HBs and their geometries, and database B including ca. 400 protein–ligand complexes with ca. 2200 HBs, their geometries, and bond strengths determined via our local vibrational mode analysis. We identified seven major HB patterns, which can be utilized as a de novo QSAR model to predict the binding affinity for a specific protein–ligand complex. Glycine was reported as the most abundant amino acid residue in both donor and acceptor profiles, and N–H⋯O was the most frequent HB type found in database A. HBs were preferred to be in the linear range, and linear HBs were identified as the strongest. HBs with HB angles in the range of 100–110°, typically forming intramolecular five-membered ring structures, showed good hydrophobic properties and membrane permeability. Utilizing database B, we found a generalized Badger’s relationship for more than 2200 protein–ligand HBs. In addition, the strength and occurrence maps between each amino acid residue and ligand functional groups open an attractive possibility for a novel drug-design approach and for determining drug selectivity and affinity, and they can also serve as an important tool for the hit-to-lead process. 
    more » « less
  2. Abstract Structures at serine‐proline sites in proteins were analyzed using a combination of peptide synthesis with structural methods and bioinformatics analysis of the PDB. Dipeptides were synthesized with the proline derivative (2S,4S)‐(4‐iodophenyl)hydroxyproline [hyp(4‐I‐Ph)]. The crystal structure of Boc‐Ser‐hyp(4‐I‐Ph)‐OMe had two molecules in the unit cell. One molecule exhibitedcis‐proline and a type VIa2 β‐turn (BcisD). Thecis‐proline conformation was stabilized by a C–H/O interaction between Pro C–Hαand the Ser side‐chain oxygen. NMR data were consistent with stabilization ofcis‐proline by a C–H/O interaction in solution. The other crystallographically observed molecule hadtrans‐Pro and both residues in the PPII conformation. Two conformations were observed in the crystal structure of Ac‐Ser‐hyp(4‐I‐Ph)‐OMe, with Ser adopting PPII in one and the β conformation in the other, each with Pro in the δ conformation andtrans‐Pro. Structures at Ser‐Pro sequences were further examined via bioinformatics analysis of the PDB and via DFT calculations. Ser‐Pro versus Ala–Pro sequences were compared to identify bases for Ser stabilization of local structures. C–H/O interactions between the Ser side‐chain Oγand Pro C–Hαwere observed in 45% of structures with Ser‐cis‐Pro in the PDB, with nearly all Ser‐cis‐Pro structures adopting a type VI β‐turn. 53% of Ser‐trans‐Pro sequences exhibited main‐chain COi•••HNi+3or COi•••HNi+4hydrogen bonds, with Ser as theiresidue and Pro as thei + 1 residue. These structures were overwhelmingly either type I β‐turns or N‐terminal capping motifs on α‐helices or 310‐helices. These results indicate that Ser‐Pro sequences are particularly potent in favoring these structures. In each, Ser is in either the PPII or β conformation, with the Ser Oγcapable of engaging in a hydrogen bond with the amide N–H of thei + 2 (type I β‐turn or 310‐helix; Serχ1t) ori + 3 (α‐helix; Serχ1g+) residue. Non‐prolinecisamide bonds can also be stabilized by C–H/O interactions. 
    more » « less
  3. The ability of the CH group to act as proton donor is now widely accepted, even if the H bonds (HBs), which it forms are typically much weaker than those of the hydroxyl group, particularly for a sp3‐hybridized C. An NH3nucleophile is allowed to approach both the terminal methyl group and the hydroxyl of n‐butanol, so as to form either a CH··N or OH··N HB. Density functional theory calculations show that the latter is much stronger than the former. However, the strength of the CH··N HB can be amplified and approach much closer to that of OH··N by appropriate placement of suitable electron‐withdrawing and donating substituents on the butanol. The interaction energy of the CH··N HB reaches above 6–8 kcal mol−1in several cases, considerably larger than the prototype HB within the water dimer. 
    more » « less
  4. Hemoglobins (Hbs) of crocodilians are reportedly characterized by unique mechanisms of allosteric regulatory control, but there are conflicting reports regarding the importance of different effectors, such as chloride ions, organic phosphates, and CO 2 . Progress in understanding the unusual properties of crocodilian Hbs has also been hindered by a dearth of structural information. Here, we present the first comparative analysis of blood properties and Hb structure and function in a phylogenetically diverse set of crocodilian species. We examine mechanisms of allosteric regulation in the Hbs of 13 crocodilian species belonging to the families Crocodylidae and Alligatoridae. We also report new amino acid sequences for the α- and β-globins of these taxa, which, in combination with structural analyses, provide insights into molecular mechanisms of allosteric regulation. All crocodilian Hbs exhibited a remarkably strong sensitivity to CO 2 , which would permit effective O 2 unloading to tissues in response to an increase in metabolism during intense activity and diving. Although the Hbs of all crocodilians exhibit similar intrinsic O 2 -affinities, there is considerable variation in sensitivity to Cl − ions and ATP, which appears to be at least partly attributable to variation in the extent of NH 2 -terminal acetylation. Whereas chloride appears to be a potent allosteric effector of all crocodile Hbs, ATP has a strong, chloride-independent effect on Hb-O 2 affinity only in caimans. Modeling suggests that allosteric ATP binding has a somewhat different structural basis in crocodilian and mammalian Hbs. 
    more » « less
  5. Abstract Short hydrogen bonds (SHBs), whose donor and acceptor heteroatoms lie within 2.7 Å, exhibit prominent quantum mechanical characters and are connected to a wide range of essential biomolecular processes. However, exact determination of the geometry and functional roles of SHBs requires a protein to be at atomic resolution. In this work, we analyze 1260 high-resolution peptide and protein structures from the Protein Data Bank and develop a boosting based machine learning model to predict the formation of SHBs between amino acids. This model, which we name as machine learning assisted prediction of short hydrogen bonds (MAPSHB), takes into account 21 structural, chemical and sequence features and their interaction effects and effectively categorizes each hydrogen bond in a protein to a short or normal hydrogen bond. The MAPSHB model reveals that the type of the donor amino acid plays a major role in determining the class of a hydrogen bond and that the side chain Tyr-Asp pair demonstrates a significant probability of forming a SHB. Combining electronic structure calculations and energy decomposition analysis, we elucidate how the interplay of competing intermolecular interactions stabilizes the Tyr-Asp SHBs more than other commonly observed combinations of amino acid side chains. The MAPSHB model, which is freely available on our web server, allows one to accurately and efficiently predict the presence of SHBs given a protein structure with moderate or low resolution and will facilitate the experimental and computational refinement of protein structures. 
    more » « less