Symmetry-adapted perturbation theory (SAPT) is an ab initio approach that directly computes noncovalent interaction energies in terms of electrostatics, exchange repulsion, induction/polarization, and London dispersion components. Due to its high computational scaling, routine applications of even the lowest order of SAPT are typically limited to a few hundred atoms. To address this limitation, we report here the addition of electrostatic embedding to the SAPT (EE-SAPT) and ISAPT (EE-ISAPT) methods. We illustrate the embedding scheme using water trimer as a prototype example. Then, we show that EE-SAPT/EE-ISAPT can be applied for efficiently and accurately computing noncovalent interactions in large systems, including solvated dimers and protein–ligand systems. In the latter application, particular care must be taken to properly handle the quantum mechanics/molecular mechanics boundary when it cuts covalent bonds. We investigate various schemes for handling charges near this boundary and demonstrate which are most effective in the context of charge-embedded SAPT.
more »
« less
Electrostatically embedded molecules‐in‐molecules approach and its application to molecular clusters
Abstract We report the application of our fragment‐based quantum chemistry model MIM (Molecules‐In‐Molecules) with electrostatic embedding. The method is termed “EE‐MIM (ElectrostaticallyEmbeddedMolecules‐In‐Molecules)” and accounts for the missing electrostatic interactions in the subsystems resulting from fragmentation. Point charges placed at the atomic positions are used to represent the interaction of each subsystem with the rest of the molecule with minimal increase in the computational cost. We have carefully calibrated this model on a range of different sizes of clusters containing up to 57 water molecules. The fragmentation methods have been applied with the goal of reproducing the unfragmented total energy at the MP2/6‐311G(d,p) level. Comparative analysis has been carried out between MIM and EE‐MIM to gauge the impact of electrostatic embedding. Performance of several different parameters such as the type of charge and levels of fragmentation are analyzed for the prediction of absolute energies. The use of background charges in subsystem calculations improves the performance of both one‐ and two‐layer MIM while it is noticeably important in the case of one‐layer MIM. Embedded charges for two‐layer MIM are obtained from a full system calculation at the low‐level. For one‐layer MIM, in the absence of a full system calculation, two different types of embedded charges, namely, Geometry dependent (GD) and geometry independent (GI) charges, are used. A self‐consistent procedure is employed to obtain GD charges. We have further tested our method on challenging charged systems with stronger intermolecular interactions, namely, protonated ammonia clusters (containing up to 30 ammonia molecules). The observations are similar to water clusters with improved performance using embedded charges. Overall, the performance of NPA charges as embedded charges is found to be the best.
more »
« less
- Award ID(s):
- 1665427
- PAR ID:
- 10360254
- Publisher / Repository:
- Wiley Blackwell (John Wiley & Sons)
- Date Published:
- Journal Name:
- Journal of Computational Chemistry
- Volume:
- 42
- Issue:
- 10
- ISSN:
- 0192-8651
- Page Range / eLocation ID:
- p. 719-734
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Embedding is widely used in recommendation models to learn feature representations. However, the traditional embedding technique that assigns a fixed size to all categorical features may be suboptimal due to the following reasons. In recommendation domain, the majority of categorical features' embeddings can be trained with less capacity without impacting model performance, thereby storing embeddings with equal length may incur unnecessary memory usage. Existing work that tries to allocate customized sizes for each feature usually either simply scales the embedding size with feature's popularity or formulates this size allocation problem as an architecture selection problem. Unfortunately, most of these methods either have large performance drop or incur significant extra time cost for searching proper embedding sizes. In this article, instead of formulating the size allocation problem as an architecture selection problem, we approach the problem from a pruning perspective and proposePruning-basedMulti-sizeEmbedding (PME) framework. During the search phase, we prune the dimensions that have the least impact on model performance in the embedding to reduce its capacity. Then, we show that the customized size of each token can be obtained by transferring the capacity of its pruned embedding with significant less search cost. Experimental results validate that PME can efficiently find proper sizes and hence achieve strong performance while significantly reducing the number of parameters in the embedding layer.more » « less
-
Abstract Accurate estimation of solvation free energy (SFE) lays the foundation for accurate prediction of binding free energy. The Poisson‐Boltzmann (PB) or generalized Born (GB) combined with surface area (SA) continuum solvation method (PBSA and GBSA) have been widely used in SFE calculations because they can achieve good balance between accuracy and efficiency. However, the accuracy of these methods can be affected by several factors such as the charge models, polar and nonpolar SFE calculation methods and the atom radii used in the calculation. In this work, the performance of the ABCG2 (AM1‐BCC‐GAFF2) charge model as well as other two charge models, that is, RESP (Restrained Electrostatic Potential) and AM1‐BCC (Austin Model 1‐bond charge corrections), on the SFE prediction of 544 small molecules in water by PBSA/GBSA was evaluated. In order to improve the performance of the PBSA prediction based on the ABCG2 charge, we further explored the influence of atom radii on the prediction accuracy and yielded a set of atom radius parameters for more accurate SFE prediction using PBSA based on the ABCG2/GAFF2 by reproducing the thermodynamic integration (TI) calculation results. The PB radius parameters of carbon, oxygen, sulfur, phosphorus, chloride, bromide and iodine, were adjusted. New atom types,on,oi,hn1,hn2,hn3, were introduced to further improve the fitting performance. Then, we tuned the parameters in the nonpolar SFE model using the experimental SFE data and the PB calculation results. By adopting the new radius parameters and new nonpolar SFE model, the root mean square error (RMSE) of the SFE calculation for the 544 molecules decreased from 2.38 to 1.05 kcal/mol. Finally, the new radius parameters were applied in the prediction of protein‐ligand binding free energies using the MM‐PBSA method. For the eight systems tested, we could observe higher correlation between the experiment data and calculation results and smaller prediction errors for the absolute binding free energies, demonstrating that our new radius parameters can improve the free energy calculation using the MM‐PBSA method.more » « less
-
Ices in the interstellar medium largely exist as amorphous solids composed of small molecules including ammonia, water, and carbon dioxide. Describing gas-phase molecules can be readily accomplished with current high-level quantum chemical calculations with the description of crystalline solids becoming more readily accomplished. Differently, amorphous solids require more novel approaches. The present work describes a method for generating amorphous structures and constructing electronic spectra through a combination of quantum chemical calculations and statistical mechanics. The structures are generated through a random positioning program and DFT methods, such as ωB97-XD and CAM-B3LYP. A Boltzmann distribution weights the excitations to compile a final spectrum from a sampling of molecular clusters. Three ice analogs are presented herein consisting of ammonia, carbon dioxide, and water. Ammonia and carbon dioxide provide semi-quantitative agreement with experiment for CAM-B3LYP/6-311++G(2d,2p) from 30 clusters of 8 molecules. Meanwhile, the amorphous water description improves when the sample size is increased in cluster size and count to as many as 105 clusters of 32 water molecules. The described methodology can produce highly comparative descriptions of electronic spectra for ice analogs and can be used to predict electronic spectra for other ice analogs.more » « less
-
Methods which accurately predict protein – ligand binding strengths are critical for drug discovery. In the last two decades, advances in chemical modelling have enabled steadily accelerating progress in the discovery and optimization of structure-based drug design. Most computational methods currently used in this context are based on molecular mechanics force fields that often have deficiencies in describing the quantum mechanical (QM) aspects of molecular binding. In this study, we show the competitiveness of our QM-based Molecules-in-Molecules (MIM) fragmentation method for characterizing binding energy trends for seven different datasets of protein – ligand complexes. By using molecular fragmentation, the MIM method allows for accelerated QM calculations. We demonstrate that for classes of structurally similar ligands bound to a common receptor, MIM provides excellent correlation to experiment, surpassing the more popular Molecular Mechanics Poisson-Boltzmann Surface Area (MM/PBSA) and Molecular Mechanics Generalized Born Surface Area (MM/GBSA) methods. The MIM method offers a relatively simple, well-defined protocol by which binding trends can be ascertained at the QM level and is suggested as a promising option for lead optimization in structure-based drug design.more » « less
An official website of the United States government
