skip to main content

Title: Large scale dataset of real space electronic charge density of cubic inorganic materials from density functional theory (DFT) calculations
Abstract Driven by the big data science, material informatics has attracted enormous research interests recently along with many recognized achievements. To acquire knowledge of materials by previous experience, both feature descriptors and databases are essential for training machine learning (ML) models with high accuracy. In this regard, the electronic charge density ρ ( r ), which in principle determines the properties of materials at their ground state, can be considered as one of the most appropriate descriptors. However, the systematic electronic charge density ρ ( r ) database of inorganic materials is still in its infancy due to the difficulties in collecting raw data in experiment and the expensive first-principles based computational cost in theory. Herein, a real space electronic charge density ρ ( r ) database of 17,418 cubic inorganic materials is constructed by performing high-throughput density functional theory calculations. The displayed ρ ( r ) patterns show good agreements with those reported in previous studies, which validates our computations. Further statistical analysis reveals that it possesses abundant and diverse data, which could accelerate ρ ( r ) related machine learning studies. Moreover, the electronic charge density database will also assists chemical bonding identifications and promotes new crystal discovery in experiments.  more » « less
Award ID(s):
1655740 1905775 2030128
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
Scientific Data
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Density functional theory (DFT) has been a critical component of computational materials research and discovery for decades. However, the computational cost of solving the central Kohn–Sham equation remains a major obstacle for dynamical studies of complex phenomena at-scale. Here, we propose an end-to-end machine learning (ML) model that emulates the essence of DFT by mapping the atomic structure of the system to its electronic charge density, followed by the prediction of other properties such as density of states, potential energy, atomic forces, and stress tensor, by using the atomic structure and charge density as input. Our deep learning model successfully bypasses the explicit solution of the Kohn-Sham equation with orders of magnitude speedup (linear scaling with system size with a small prefactor), while maintaining chemical accuracy. We demonstrate the capability of this ML-DFT concept for an extensive database of organic molecules, polymer chains, and polymer crystals.

    more » « less
  2. Discovering new materials with desired properties has been a dominant and crucial topic of interest in the field of materials science in the past few decades. In this work, novel carbon allotropes and ternary B–C–N structures were generated using the state-of-the-art RG 2 code. All structures were fully optimized using density functional theory with first-principles calculations. Several hundred carbon allotropes and ternary B–C–N structures were identified to be superhard materials. The thermodynamic stability of some randomly selected superhard materials was confirmed by evaluating the full phonon dispersions in the Brillouin zone. The new carbon allotropes and ternary B–C–N structures possess a wide range of mechanical properties generally and Vickers hardness specifically. Through 2D Pearson's correlation map, we first reproduced the well-accepted explanation and relationship of the Vickers hardness of the generated structures with other mechanical properties such as shear modulus, bulk modulus, Pugh's ratio, universal anisotropy, and Poisson's ratio. We then propose two fundamentally new descriptors from the electronic level, namely local potential and electron localization function averaged over a unit cell, both of which exhibit a strong correlation with Vickers hardness. More importantly, these descriptors are easy to access from first-principles calculations (at least two orders of magnitude faster than the traditional calculation of elastic constants), and thus can serve as a fast and accurate approach for screening superhard materials. We also combined these new descriptors with known composition and structural descriptors in the machine learning training process. The new descriptors significantly enhance the performance of the trained machine learning model in predicting the Vickers hardness of unknown materials, which provides strong evidence for local potential and electron localization function to be considered in future high-throughput computation. This work unravels more fundamental but previously unexplored knowledge about superhard materials and the newly proposed electronic level descriptors are expected to accelerate the discovery of new superhard materials. 
    more » « less
  3. Abstract

    Halide perovskites have attracted great interest as promising next‐generation materials in optoelectronics, ranging from solar cells to light‐emitting diodes. Despite their exceptional optoelectronic properties and low cost, the prototypical organic–inorganic hybrid lead halide perovskites suffer from toxicity and low stability. Therefore, it is of high demand to search for stable and nontoxic alternatives to the hybrid lead halide perovskites. Recently, high‐throughput computational materials design has emerged as a powerful approach to accelerate the discovery of new halide perovskite compositions or even novel compounds beyond perovskites. In this review, we discuss how this approach discovers halide perovskites and beyond for optoelectronics. We first overview the background of halide perovskites and methodologies in high‐throughput computational design. Then, we focus on materials properties for different optoelectronic applications, and how they are assessed with materials descriptors. Finally, we review different studies in terms of specific materials types to discuss their design principles, screening results, and experimental verification.

    This article is categorized under:

    Structure and Mechanism > Computational Materials Science

    Electronic Structure Theory > Density Functional Theory

    more » « less
  4. Abstract

    The field of plasmonics aims to manipulate and control light through nanoscale structuring and choice of materials. Finding materials with low‐loss response to an applied optical field while exhibiting collective oscillations due to intraband transitions is an outstanding challenge. This is viewed as a materials selection problem that bridges the gap between the large number of candidate materials and the high computational cost to accurately compute their individual optical properties. To address this, online databases that compile computational data for numerous properties of tens to hundreds of thousands of materials are combined with first‐principles simulations and the Drude model. By means of density functional theory (DFT), a training set of geometry‐dependent plasmonic quality factors for ≈1000 materials is computed and subsequently random‐forest regressors are trained on these data. Descriptors are limited to symmetry, quantities obtained using the chemical formula, and the Mendeleev database, which allows to rapidly screen 7445 candidates on Materials Project. Using DFT to compute quality factors for the 233 most promising materials, AlCu3, ZnCu, and ZnGa3are identified as excellent potential new plasmonic metals. This finding is substantiated by analyzing their electronic structure and interband optical properties in detail.

    more » « less
  5. UV absorption is widely used for characterizing proteins structures. The mapping of UV spectra to atomic structure of proteins relies on expensive theoretical simulations, circumventing the heavy computational cost which involves repeated quantum-mechanical simulations of excited-state properties of many fluctuating protein geometries, which has been a long-time challenge. Here we show that a neural network machine-learning technique can predict electronic absorption spectra of N -methylacetamide (NMA), which is a widely used model system for the peptide bond. Using ground-state geometric parameters and charge information as descriptors, we employed a neural network to predict transition energies, ground-state, and transition dipole moments of many molecular-dynamics conformations at different temperatures, in agreement with time-dependent density-functional theory calculations. The neural network simulations are nearly 3,000× faster than comparable quantum calculations. Machine learning should provide a cost-effective tool for simulating optical properties of proteins. 
    more » « less