NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules

https://doi.org/10.1038/s41597-020-0473-z

Smith, Justin S.; Zubatyuk, Roman; Nebgen, Benjamin; Lubbers, Nicholas; Barros, Kipton; Roitberg, Adrian E.; Isayev, Olexandr; Tretiak, Sergei (May 2020, Scientific Data)

Abstract Maximum diversification of data is a central theme in building generalized and accurate machine learning (ML) models. In chemistry, ML has been used to develop models for predicting molecular properties, for example quantum mechanics (QM) calculated potential energy surfaces and atomic charge models. The ANI-1x and ANI-1ccx ML-based general-purpose potentials for organic molecules were developed through active learning; an automated data diversification process. Here, we describe the ANI-1x and ANI-1ccx data sets. To demonstrate data diversity, we visualize it with a dimensionality reduction scheme, and contrast against existing data sets. The ANI-1x data set contains multiple QM properties from 5 M density functional theory calculations, while the ANI-1ccx data set contains 500 k data points obtained with an accurate CCSD(T)/CBS extrapolation. Approximately 14 million CPU core-hours were expended to generate this data. Multiple QM calculated properties for the chemical elements C, H, N, and O are provided: energies, atomic forces, multipole moments, atomic charges, etc. We provide this data to the community to aid research and development of ML models for chemistry.
more » « less
Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning

https://doi.org/10.1038/s41467-019-10827-4

Smith, Justin S.; Nebgen, Benjamin T.; Zubatyuk, Roman; Lubbers, Nicholas; Devereux, Christian; Barros, Kipton; Tretiak, Sergei; Isayev, Olexandr; Roitberg, Adrian E. (July 2019, Nature Communications)

Abstract Computational modeling of chemical and biological systems at atomic resolution is a crucial tool in the chemist’s toolset. The use of computer simulations requires a balance between cost and accuracy: quantum-mechanical methods provide high accuracy but are computationally expensive and scale poorly to large systems, while classical force fields are cheap and scalable, but lack transferability to new systems. Machine learning can be used to achieve the best of both approaches. Here we train a general-purpose neural network potential (ANI-1ccx) that approaches CCSD(T)/CBS accuracy on benchmarks for reaction thermochemistry, isomerization, and drug-like molecular torsions. This is achieved by training a network to DFT data then using transfer learning techniques to retrain on a dataset of gold standard QM calculations (CCSD(T)/CBS) that optimally spans chemical space. The resulting potential is broadly applicable to materials science, biology, and chemistry, and billions of times faster than CCSD(T)/CBS calculations.
more » « less
Auto3D: Automatic Generation of the Low-Energy 3D Structures with ANI Neural Network Potentials

https://doi.org/10.1021/acs.jcim.2c00817

Liu, Zhen; Zubatiuk, Tetiana; Roitberg, Adrian; Isayev, Olexandr (November 2022, Journal of Chemical Information and Modeling)

Full Text Available
TorchANI: A Free and Open Source PyTorch-Based Deep Learning Implementation of the ANI Neural Network Potentials

https://doi.org/10.1021/acs.jcim.0c00451

Gao, Xiang; Ramezanghorbani, Farhad; Isayev, Olexandr; Smith, Justin S.; Roitberg, Adrian E. (July 2020, Journal of Chemical Information and Modeling)

Full Text Available
Extending the Applicability of the ANI Deep Learning Molecular Potential to Sulfur and Halogens

https://doi.org/10.1021/acs.jctc.0c00121

Devereux, Christian; Smith, Justin S.; Davis, Kate K.; Barros, Kipton; Zubatyuk, Roman; Isayev, Olexandr; Roitberg, Adrian E. (July 2020, Journal of Chemical Theory and Computation)

Full Text Available
QSAR without borders

https://doi.org/10.1039/D0CS00098A

Muratov, Eugene N.; Bajorath, Jürgen; Sheridan, Robert P.; Tetko, Igor V.; Filimonov, Dmitry; Poroikov, Vladimir; Oprea, Tudor I.; Baskin, Igor I.; Varnek, Alexandre; Roitberg, Adrian; et al (June 2020, Chemical Society Reviews)

Prediction of chemical bioactivity and physical properties has been one of the most important applications of statistical and more recently, machine learning and artificial intelligence methods in chemical sciences. This field of research, broadly known as quantitative structure–activity relationships (QSAR) modeling, has developed many important algorithms and has found a broad range of applications in physical organic and medicinal chemistry in the past 55+ years. This Perspective summarizes recent technological advances in QSAR modeling but it also highlights the applicability of algorithms, modeling methods, and validation practices developed in QSAR to a wide range of research areas outside of traditional QSAR boundaries including synthesis planning, nanotechnology, materials science, biomaterials, and clinical informatics. As modern research methods generate rapidly increasing amounts of data, the knowledge of robust data-driven modelling methods professed within the QSAR field can become essential for scientists working both within and outside of chemical research. We hope that this contribution highlighting the generalizable components of QSAR modeling will serve to address this challenge.
more » « less
Full Text Available
Transforming Computational Drug Discovery with Machine Learning and AI

https://doi.org/10.1021/acsmedchemlett.8b00437

Smith, Justin S.; Roitberg, Adrian E.; Isayev, Olexandr (October 2018, ACS Medicinal Chemistry Letters)

Full Text Available

Search for: All records