skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Dataset of solution-based inorganic materials synthesis procedures extracted from the scientific literature
Abstract The development of a materials synthesis route is usually based on heuristics and experience. A possible new approach would be to apply data-driven approaches to learn the patterns of synthesis from past experience and use them to predict the syntheses of novel materials. However, this route is impeded by the lack of a large-scale database of synthesis formulations. In this work, we applied advanced machine learning and natural language processing techniques to construct a dataset of 35,675 solution-based synthesis procedures extracted from the scientific literature. Each procedure contains essential synthesis information including the precursors and target materials, their quantities, and the synthesis actions and corresponding attributes. Every procedure is also augmented with the reaction formula. Through this work, we are making freely available the first large dataset of solution-based inorganic materials synthesis procedures.  more » « less
Award ID(s):
1922372
PAR ID:
10367521
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Nature Publishing Group
Date Published:
Journal Name:
Scientific Data
Volume:
9
Issue:
1
ISSN:
2052-4463
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Applying AI power to predict syntheses of novel materials requires high-quality, large-scale datasets. Extraction of synthesis information from scientific publications is still challenging, especially for extracting synthesis actions, because of the lack of a comprehensive labeled dataset using a solid, robust, and well-established ontology for describing synthesis procedures. In this work, we propose the first unified language of synthesis actions (ULSA) for describing inorganic synthesis procedures. We created a dataset of 3040 synthesis procedures annotated by domain experts according to the proposed ULSA scheme. To demonstrate the capabilities of ULSA, we built a neural network-based model to map arbitrary inorganic synthesis paragraphs into ULSA and used it to construct synthesis flowcharts for synthesis procedures. Analysis of the flowcharts showed that (a) ULSA covers essential vocabulary used by researchers when describing synthesis procedures and (b) it can capture important features of synthesis protocols. The present work focuses on the synthesis protocols for solid-state, sol–gel, and solution-based inorganic synthesis, but the language could be extended in the future to include other synthesis methods. This work is an important step towards creating a synthesis ontology and a solid foundation for autonomous robotic synthesis. 
    more » « less
  2. Chalcogenide perovskites are promising semiconductor materials with attractive optoelectronic properties and appreciable stability, making them enticing candidates for photovoltaics and related electronic applications. Traditional synthesis methods for these materials have long suffered from high‐temperature requirements of 800–1000 °C. However, the recently developed solution processing route provides a way to circumvent this. By utilizing barium thiolate and ZrH2, this method is capable of synthesizing BaZrS3perovskite at modest temperatures (500–600 °C), generating crystalline domains on the order of hundreds of nanometers in size. Herein, a systematic study of this solution processing route is done to gain a mechanistic understanding of the process and to supplement the development of device quality fabrication methodologies. A barium polysulfide liquid flux is identified as playing a key role in the rapid synthesis of large‐grain BaZrS3perovskite at modest temperatures. Additionally, this mechanism is successfully extended to the related BaHfS3perovskite. The reported findings identify viable precursors, key temperature regimes, and reaction conditions that are likely to enable the large‐grain chalcogenide perovskite growth, essential toward the formation of device‐quality thin films. 
    more » « less
  3. Abstract Despite the groundbreaking advancements in the synthesis of inorganic lead halide perovskite (LHP) nanocrystals (NCs), stimulated from their intriguing size‐, composition‐, and morphology‐dependent optical and optoelectronic properties, their formation mechanism through the hot‐injection (HI) synthetic route is not well‐understood. In this work, for the first time, in‐flow HI synthesis of cesium lead iodide (CsPbI3) NCs is introduced and a comprehensive understanding of the interdependent competing reaction parameters controlling the NC morphology (nanocube vs nanoplatelet) and properties is provided. Utilizing the developed flow synthesis strategy, a change in the CsPbI3NC formation mechanism at temperatures higher than 150 °C, resulting in different CsPbI3morphologies is revealed. Through comparison of the flow‐ versus flask‐based synthesis, deficiencies of batch reactors in reproducible and scalable synthesis of CsPbI3NCs with fast formation kinetics are demonstrated. The developed modular flow chemistry route provides a new frontier for high‐temperature studies of solution‐processed LHP NCs and enables their consistent and reliable continuous nanomanufacturing for next‐generation energy technologies. 
    more » « less
  4. Phase change materials (PCMs) are important building blocks in solid-state memory and photonic devices. Solution-based processing promises large-area, cost-effective, conformal coating of optical PCMs (O-PCMs) for photonic applications. In this work, a solution processing route was developed for Ge2Sb2Se4Te1(GSST), a target PCM of interest due to its large optical contrast, broadband transparency, and improved glass-forming capability. An alkahest solvent mixture of ethanedithiol and ethylenediamine was used as a solvent system to fabricate solution-derived GSST thin films and films from these solutions were prepared and characterized using SEM, XRD, and Raman spectroscopy. 
    more » « less
  5. null (Ed.)
    In this work, an inverse design method for multi-input multi-output (MIMO) metastructured devices is developed. Large-scale inverse design problems are difficult to solve directly and often require heuristic methods or design optimization to find a solution. Inherent errors introduced by heuristic methods makes design optimization a more promising route to the realization of high performance devices. Here, a fast frequency domain solver for grids of Y-parameter matrices is developed. The solver is used together with an adjoint-based optimization routine to solve inverse metastructured design problems. The design procedure is demonstrated through the realization of a planar beamforming network for a multi-beam antenna. 
    more » « less