skip to main content

Attention:

The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 11:00 PM ET on Thursday, October 10 until 2:00 AM ET on Friday, October 11 due to maintenance. We apologize for the inconvenience.


Title: A data-driven and topological mapping approach for the a priori prediction of stable molecular crystalline hydrates
Predictions of the structures of stoichiometric, fractional, or nonstoichiometric hydrates of organic molecular crystals are immensely challenging due to the extensive search space of different water contents, host molecular placements throughout the crystal, and internal molecular conformations. However, the dry frameworks of these hydrates, especially for nonstoichiometric or isostructural dehydrates, can often be predicted from a standard anhydrous crystal structure prediction (CSP) protocol. Inspired by developments in the field of drug binding, we introduce an efficient data-driven and topologically aware approach for predicting organic molecular crystal hydrate structures through a mapping of water positions within the crystal structure. The method does not require a priori specification of water content and can, therefore, predict stoichiometric, fractional, and nonstoichiometric hydrate structures. This approach, which we term a mapping approach for crystal hydrates (MACH), establishes a set of rules for systematic determination of favorable positions for water insertion within predicted or experimental crystal structures based on considerations of the chemical features of local environments and void regions. The proposed approach is tested on hydrates of three pharmaceutically relevant compounds that exhibit diverse crystal packing motifs and void environments characteristic of hydrate structures. Overall, we show that our mapping approach introduces an advance in the efficient performance of hydrate CSP through generation of stable hydrate stoichiometries at low cost and should be considered an integral component for CSP workflows.  more » « less
Award ID(s):
1955381
NSF-PAR ID:
10401653
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
119
Issue:
43
ISSN:
0027-8424
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The accurate prediction of suitable chiral stationary phases (CSPs) for resolving the enantiomers of a given compound poses a significant challenge in chiral chromatography. Previous attempts at developing machine learning models for structure-based CSP prediction have primarily relied on 1D SMILES strings\footnote{The simplified molecular-input line-entry system (SMILES) is a specification in the form of a line notation for describing the structure of chemical species using short ASCII strings.} or 2D graphical representations of molecular structures, and have met with only limited success. In this study, we apply the recently developed 3D molecular conformation representation learning algorithm, which uses rapid conformational analysis and point clouds of atom positions in 3D space, enabling efficient chemical structure-based machine learning. By harnessing the power of the rapid 3D molecular representation learning and a dataset comprising over 300,000 chromatographic enantioseparation records sourced from the literature, our models afford notable improvements for the chemical structure-based choice of appropriate CSP for enantioseparation, paving the way for more efficient and informed decision-making in the field of chiral chromatography. 
    more » « less
  2. Hydrate formation is often unavoidable during crystallization, leading to performance degradation of pharmaceuticals and energetics. In some cases, water molecules trapped within crystal lattices can be substituted for hydrogen peroxide, improving the solubility of drugs and detonation performance of explosives. The present work compares hydrates and hydrogen peroxide solvates in two ways: (1) analyzing structural motifs present in crystal structures accessed from the Cambridge Structural Database and (2) developing potential energy surfaces for water and hydrogen peroxide interacting with functional groups of interest at geometries relevant to the solid state. By elucidating fundamental differences in local interactions that can be formed with molecules of hydrogen peroxide and/or water, the analyses presented here provide a foundation for the design and selection of candidate molecules for the formation of hydrogen peroxide solvates. 
    more » « less
  3. Clathrate hydrates form and grow at interfaces. Understanding the relevant molecular processes is crucial for developing hydrate-based technologies. Many computational studies focus on hydrate growth within the aqueous phase using the ‘direct coexistence method’, which is limited in its ability to investigate hydrate film growth at hydrocarbon-water interfaces. To overcome this shortcoming, a new simulation setup is presented here, which allows us to study the growth of a methane hydrate nucleus in a system where oil–water, hydrate-water, and hydrate-oil interfaces are all simultaneously present, thereby mimicking experimental setups. Using this setup, hydrate growth is studied here under the influence of two additives, a polyvinylcaprolactam oligomer and sodium dodecyl sulfate, at varying concentrations. Our results confirm that hydrate films grow along the oil–water interface, in general agreement with visual experimental observations; growth, albeit slower, also occurs at the hydrate-water interface, the interface most often interrogated via simulations. The results obtained demonstrate that the additives present within curved interfaces control the solubility of methane in the aqueous phase, which correlates with hydrate growth rate. Building on our simulation insights, we suggest that by combining data for the potential of mean force profile for methane transport across the oil–water interface and for the average free energy required to perturb a flat interface, it is possible to predict the performance of additives used to control hydrate growth. These insights could be helpful to achieve optimal methane storage in hydrates, one of many applications which are attracting significant fundamental and applied interests. 
    more » « less
  4. Identifying local structure in molecular simulations is of utmost importance. The most common existing approach to identify local structure is to calculate some geometrical quantity referred to as an order parameter. In simple cases order parameters are physically intuitive and trivial to develop ( e.g. , ion-pair distance), however in most cases, order parameter development becomes a much more difficult endeavor ( e.g. , crystal structure identification). Using ideas from computer vision, we adapt a specific type of neural network called a PointNet to identify local structural environments in molecular simulations. A primary challenge in applying machine learning techniques to simulation is selecting the appropriate input features. This challenge is system-specific and requires significant human input and intuition. In contrast, our approach is a generic framework that requires no system-specific feature engineering and operates on the raw output of the simulations, i.e. , atomic positions. We demonstrate the method on crystal structure identification in Lennard-Jones (four different phases), water (eight different phases), and mesophase (six different phases) systems. The method achieves as high as 99.5% accuracy in crystal structure identification. The method is applicable to heterogeneous nucleation and it can even predict the crystal phases of atoms near external interfaces. We demonstrate the versatility of our approach by using our method to identify surface hydrophobicity based solely upon positions and orientations of surrounding water molecules. Our results suggest the approach will be broadly applicable to many types of local structure in simulations. 
    more » « less
  5. The syntheses and crystal structures of two bimetallic molecular compounds, namely, bis[bis(6,6′-dimethyl-2,2′-bipyridine)copper(I)] hexafluoridozirconate(IV) 1.134-hydrate, [Cu(dmbpy) 2 ] 2 [ZrF 6 ]·1.134H 2 O (dmbpy = 6,6′-dimethyl-2,2′-bipyridyl, C 12 H 12 N 2 ), (I), and bis[bis(6,6′-dimethyl-2,2′-bipyridine)copper(I)] hexafluoridohafnate(IV) 0.671-hydrate, [Cu(dmbpy) 2 ] 2 [HfF 6 ]·0.671H 2 O, (II), are reported. Apart from a slight site occupany difference for the water molecule of crystallization, compounds (I) and (II) are isostructural, featuring isolated tetrahedral cations of copper(I) ions coordinated by two dmbpy ligands and centrosymmetric, octahedral anions of fluorinated early transition metals. The tetrahedral environments of the copper complexes are distorted owing to the steric effects of the dmbpy ligands. The extended structures are built up through Coulombic interactions between cations and anions and π–π stacking interactions between heterochiral Δ- and Λ-[Cu(dmbpy) 2 ] + complexes. A comparison between the title compounds and other [Cu(dmbpy) 2 ] + compounds with monovalent and bivalent anions reveals a significant influence of the cation-to-anion ratio on the resulting crystal packing architectures, providing insights for future crystal design of distorted tetrahedral copper compounds. 
    more » « less