skip to main content

Attention:

The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 11:00 PM ET on Thursday, October 10 until 2:00 AM ET on Friday, October 11 due to maintenance. We apologize for the inconvenience.


Title: Evolutionary niching in the GAtor genetic algorithm for molecular crystal structure prediction
The goal of molecular crystal structure prediction (CSP) is to find all the plausible polymorphs for a given molecule. This requires performing global optimization over a high-dimensional search space. Genetic algorithms (GAs) perform global optimization by starting from an initial population of structures and generating new candidate structures by breeding the fittest structures in the population. Typically, the fitness function is based on relative lattice energies, such that structures with lower energies have a higher probability of being selected for mating. GAs may be adapted to perform multi-modal optimization by using evolutionary niching methods that support the formation of several stable subpopulations and suppress the over-sampling of densely populated regions. Evolutionary niching is implemented in the GAtor molecular crystal structure prediction code by using techniques from machine learning to dynamically cluster the population into niches of structural similarity. A cluster-based fitness function is constructed such that structures in less populated clusters have a higher probability of being selected for breeding. Here, the effects of evolutionary niching are investigated for the crystal structure prediction of 1,3-dibromo-2-chloro-5-fluorobenzene. Using the cluster-based fitness function increases the success rate of generating the experimental structure and additional low-energy structures with similar packing motifs.  more » « less
Award ID(s):
1554428
NSF-PAR ID:
10198084
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Faraday Discussions
Volume:
211
ISSN:
1359-6640
Page Range / eLocation ID:
61 to 77
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Crystal structure prediction is now playing an increasingly important role in the discovery of new materials or crystal engineering. Global optimization methods such as genetic algorithms (GAs) and particle swarm optimization have been combined with first-principles free energy calculations to predict crystal structures given the composition or only a chemical system. While these approaches can exploit certain crystal patterns such as symmetry and periodicity in their search process, they usually do not exploit the large amount of implicit rules and constraints of atom configurations embodied in the large number of known crystal structures. They currently can only handle crystal structure prediction of relatively small systems. Inspired by the knowledge-rich protein structure prediction approach, herein we explore whether known geometric constraints such as the atomic contact map of a target crystal material can help predict its structure given its space group information. We propose a global optimization-based algorithm, CMCrystal, for crystal structure (atomic coordinates) reconstruction based on atomic contact maps. Based on extensive experiments using six global optimization algorithms, we show that it is viable to reconstruct the crystal structure given the atomic contact map for some crystal materials, but more geometric or physicochemical constraints are needed to achieve the successful reconstruction of other materials. 
    more » « less
  2. Molecular crystal structure prediction is increasingly being applied to study the solid form landscapes of larger, more flexible pharmaceutical molecules. Despite many successes in crystal structure prediction, van der Waals-inclusive density functional theory (DFT) methods exhibit serious failures predicting the polymorph stabilities for a number of systems exhibiting conformational polymorphism, where changes in intramolecular conformation lead to different intermolecular crystal packings. Here, the stabilities of the conformational polymorphs of o -acetamidobenzamide, ROY, and oxalyl dihydrazide are examined in detail. DFT functionals that have previously been very successful in crystal structure prediction perform poorly in all three systems, due primarily to the poor intramolecular conformational energies, but also due to the intermolecular description in oxalyl dihydrazide. In all three cases, a fragment-based dispersion-corrected second-order Møller–Plesset perturbation theory (MP2D) treatment of the crystals overcomes these difficulties and predicts conformational polymorph stabilities in good agreement with experiment. These results highlight the need for methods which go beyond current-generation DFT functionals to make crystal polymorph stability predictions truly reliable. 
    more » « less
  3. Metastable materials are abundant in nature and technology, showcasing remarkable properties that inspire innovative materials design. However, traditional crystal structure prediction methods, which rely solely on energetic factors to determine a structure’s fitness, are not suitable for predicting the vast number of potentially synthesizable phases that represent a local minimum corresponding to a state in thermodynamic equilibrium. Here, we present a new approach for the prediction of metastable phases with specific structural features and interface this method with the XTALOPT evolutionary algorithm. Our method relies on structural features that include the local crystalline order (e.g, the coordination number or chemical environment), and symmetry (e.g, Bravais lattice and space group) to filter the breeding pool of an evolutionary crystal structure search. The effectiveness of this approach is benchmarked on three known metastable systems: XeN8, with a two-dimensional polymeric nitrogen sublattice, brookite TiO2, and a high pressure BaH4 phase, which was recently characterized. Additionally, a newly predicted metastable melaminate salt, P1̅ WC3N6, was found to possess an energy that is lower than that of two phases proposed in a recent computational study. The method presented here could help in identifying the structures of compounds that have already been synthesized, and in developing new synthesis targets with desired properties. 
    more » « less
  4. Abstract

    Many cooperatively breeding species live in groups with complex structure—large group sizes, low and variable kin structure, and multiple breeding pairs. Since these mixed‐kin groups typically form because of immigration of unrelated individuals of both sexes in addition to limited offspring dispersal, differences in patterns of dispersal can generate variation in group structure, even within the same species or population.

    Here, we examine how environmentally mediated dispersal patterns influence variation in group structure in the plural breeding superb starling (Lamprotornis superbus), an avian cooperative breeder that inhabits a spatiotemporally variable savanna environment and forms mixed‐kin groups with variable group sizes and more than one breeding pair per group.

    Using 4068 genome‐wide polymorphic loci and fine‐scale, remotely sensed ecological data from 22 groups sampled across a nearly 200 km2environmental gradient in central Kenya, we find evidence of not only frequent and long‐distance dispersal in both sexes (low isolation‐by‐distance and weak genetic structure), but also directional dispersal from small groups in lower quality habitat with low normalised difference vegetation index (NDVI) to large groups in higher quality habitat with high NDVI.

    Additionally, we find stronger genetic structure among groups in lower quality habitat, and higher genetic diversity and lower relatedness of groups in higher quality habitat. Previous work using long‐term data from groups in the same population has shown that groups with lower relatedness are larger and have more breeding pairs.

    Long‐distance, directional dispersal to maximise individual fitness can thus lead to smaller and simpler kin‐based social groups in lower quality habitat, but larger and more complex mixed‐kin groups in higher quality habitat.

    Such intraspecific, within‐population variation in group structure, including variation in kin structure of social groups, could have profound implications for the relative importance of the evolutionary mechanisms (i.e. direct vs. indirect fitness benefits) underlying the formation of cooperative societies.

     
    more » « less
  5. null (Ed.)
    Module for ab initio structure evolution (MAISE) is an open-source package for materials modeling and prediction. The code’s main feature is an automated generation of neural network (NN) interatomic potentials for use in global structure searches. The systematic construction of Behler–Parrinello-type NN models approximating ab initio energy and forces relies on two approaches introduced in our recent studies. An evolutionary sampling scheme for generating reference structures improves the NNs’ mapping of regions visited in unconstrained searches, while a stratified training approach enables the creation of standardized NN models for multiple elements. A more flexible NN architecture proposed here expands the applicability of the stratified scheme for an arbitrary number of elements. The full workflow in the NN development is managed with a customizable ‘MAISE-NET’ wrapper written in Python. The global structure optimization capability in MAISE is based on an evolutionary algorithm applicable for nanoparticles, films, and bulk crystals. A multitribe extension of the algorithm allows for an efficient simultaneous optimization of nanoparticles in a given size range. Implemented structure analysis functions include fingerprinting with radial distribution functions and finding space groups with the SPGLIB tool. This work overviews MAISE’s available features, constructed models, and confirmed predictions. 
    more » « less