Abstract Vibrational spectroscopy enables critical insight into the structural and dynamic properties of molecules. Presently, the majority of theoretical approaches to spectroscopy employ wavefunction‐basedab initioor density functional methods that rely on the harmonic approximation. This approximation breaks down for large molecules with strongly anharmonic bonds or for molecules with large internuclear separations. An alternative to these methods involves generating molecular anharmonic potential energy surfaces (potentials) and using them to extrapolate the vibrational frequencies. This study examines the efficacy of density functional theory (DFT) and the correlation consistent Composite Approach (ccCA) in generating anharmonic frequencies from potentials of small main group molecules. Vibrational self‐consistent field Theory (VSCF) and post‐VSCF methods were used to calculate the fundamental frequencies of these molecules from their potentials. Functional choice, basis set selection, and mode‐coupling are also examined as factors in influencing accuracy. The absolute deviations for the calculated frequencies using potentials at the ccCA level of theory were lower than the potentials at the DFT level. With DFT resulting in bending modes that are better described than those of ccCA, a multilevel DFT:ccCA approach where DFT potentials are used for single vibrational mode potentials and ccCA is used for vibrational mode‐mode couplings can be utilized for larger polyatomic systems. The frequencies obtained with this multilevel approach using VCIPSI‐PT2 were closer to experimental frequencies than the scaled harmonic frequencies, indicating the success of utilizing post‐VSCF methods to generate more accurate representations of computed infrared spectra.
more »
« less
Benchmarking computational methods to calculate the octanol/water partition coefficients of a diverse set of organic molecules
In the discovery process of new drugs and the development of novel therapies in medicine, computational modeling is a complementary tool for the design of new molecules by predicting for example their solubility in different solvents. Here, we benchmarked several computational methods to calculate the partition coefficients of a diverse set of 161 organic molecules with experimental logP values obtained from the literature. In general, density functional theory methods yielded the best correlations and lower average deviations. Although results are obtained faster with semiempirical and molecular mechanics methodologies, these methods yielded higher average deviations and lower correlation coefficients than hybrid density functional theory methods. We recommend the use of an empirical formula to correct the calculated values with each methodology tested.
more »
« less
- Award ID(s):
- 1757365
- PAR ID:
- 10315508
- Date Published:
- Journal Name:
- ChemRxiv
- ISSN:
- 2573-2293
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The deviations from linearity of the energy as a function of the number of electrons that arise with current approximations to the exchange–correlation (XC) energy functional have important consequences for the frontier eigenvalues of molecules and the corresponding valence-band maxima for solids. In this work, we present an analysis of the exact theory that allows one to infer the effects of such approximations on the highest occupied and lowest unoccupied molecular orbital eigenvalues. Then, we show the importance of the asymptotic behavior of the XC potential in the generalized gradient approximation (GGA) in the case of the NCAPR functional (nearly correct asymptotic potential revised) for determining the shift of the frontier orbital eigenvalues toward the exact values. Thereby we establish a procedure at the GGA level of refinement that allows one to make a single calculation to determine the ionization potential, the electron affinity, and the hardness of molecules (and its solid counterpart, the bandgap) with an accuracy equivalent to that obtained for those properties through energy differences, a procedure that requires three calculations. For solids, the accuracy achieved for the bandgap lies rather close to that which is obtained through hybrid XC energy functionals, but those also demand much greater computational effort than what is required with the simple NCAPR GGA calculation.more » « less
-
Polarizabilities and London dispersion forces are important to many chemical processes. Force fields for classical atomistic simulations can be constructed using atom-in-material polarizabilities and C n ( n = 6, 8, 9, 10…) dispersion coefficients. This article addresses the key question of how to efficiently assign these parameters to constituent atoms in a material so that properties of the whole material are better reproduced. We develop a new set of scaling laws and computational algorithms (called MCLF) to do this in an accurate and computationally efficient manner across diverse material types. We introduce a conduction limit upper bound and m -scaling to describe the different behaviors of surface and buried atoms. We validate MCLF by comparing results to high-level benchmarks for isolated neutral and charged atoms, diverse diatomic molecules, various polyatomic molecules ( e.g. , polyacenes, fullerenes, and small organic and inorganic molecules), and dense solids (including metallic, covalent, and ionic). We also present results for the HIV reverse transcriptase enzyme complexed with an inhibitor molecule. MCLF provides the non-directionally screened polarizabilities required to construct force fields, the directionally-screened static polarizability tensor components and eigenvalues, and environmentally screened C 6 coefficients. Overall, MCLF has improved accuracy compared to the TS-SCS method. For TS-SCS, we compared charge partitioning methods and show DDEC6 partitioning yields more accurate results than Hirshfeld partitioning. MCLF also gives approximations for C 8 , C 9 , and C 10 dispersion coefficients and quantum Drude oscillator parameters. This method should find widespread applications to parameterize classical force fields and density functional theory (DFT) + dispersion methods.more » « less
-
Kohn–Sham density functional theory has been the most popular method in electronic structure calculations. To fulfill the increasing accuracy requirements, new approximate functionals are needed to address key issues in existing approximations. It is well known that nonlocal components are crucial. Current nonlocal functionals mostly require orbital dependence such as in Hartree–Fock exchange and many-body perturbation correlation energy, which, however, leads to higher computational costs. Deviating from this pathway, we describe functional nonlocality in a new approach. By partitioning the total density to atom-centered local densities, a many-body expansion is proposed. This many-body expansion can be truncated at one-body contributions, if a base functional is used and an energy correction is approximated. The contribution from each atom-centered local density is a single finite-range nonlocal functional that is universal for all atoms. We then use machine learning to develop this universal atom-centered functional. Parameters in this functional are determined by fitting to data that are produced by high-level theories. Extensive tests on several different test sets, which include reaction energies, reaction barrier heights, and non-covalent interaction energies, show that the new functional, with only the density as the basic variable, can produce results comparable to the best-performing double-hybrid functionals, (for example, for the thermochemistry test set selected from the GMTKN55 database, BLYP based machine learning functional gives a weighted total mean absolute deviations of 3.33 kcal/mol, while DSD-BLYP-D3(BJ) gives 3.28 kcal/mol) with a lower computational cost. This opens a new pathway to nonlocal functional development and applications.more » « less
-
Discovering novel molecules with targeted properties remains a formidable challenge in materials science, often likened to finding a needle in a haystack. Traditional experimental approaches are slow, costly, and inefficient. In this study, we present an inverse design framework based on a molecular graph conditional variational autoencoder (CVAE) that enables the generation of new molecules with user-specified optical properties, particularly molar extinction coefficient ($$\varepsilon$$). Our model encodes molecular graphs, derived from SMILES strings, into a structured latent space, and then decodes them into valid molecular structures conditioned on a target $$\varepsilon$$ value. Trained on a curated dataset of known molecules with corresponding extinction coefficients, the CVAE learns to generate chemically valid structures, as verified by RDKit. Subsequent Density Functional Theory (DFT) simulations confirm that many of the generated molecules exhibit the electronic structures similar to those molecules with desired $$\varepsilon$$ values. We have also verified the $$\varepsilon$$ values of the generated molecules using a graph neural network (GNN) and the synthesizability of those molecules using an open-source module named ASKCOS. This approach demonstrates the potential of CVAEs to accelerate molecular discovery by enabling user-guided, property-driven molecule generation -- offering a scalable, data-driven alternative to traditional trial-and-error synthesis.more » « less
An official website of the United States government

