Abstract The development of a materials synthesis route is usually based on heuristics and experience. A possible new approach would be to apply data-driven approaches to learn the patterns of synthesis from past experience and use them to predict the syntheses of novel materials. However, this route is impeded by the lack of a large-scale database of synthesis formulations. In this work, we applied advanced machine learning and natural language processing techniques to construct a dataset of 35,675 solution-based synthesis procedures extracted from the scientific literature. Each procedure contains essential synthesis information including the precursors and target materials, their quantities, and the synthesis actions and corresponding attributes. Every procedure is also augmented with the reaction formula. Through this work, we are making freely available the first large dataset of solution-based inorganic materials synthesis procedures.
more »
« less
Precursor recommendation for inorganic synthesis by machine learning materials similarity from scientific literature
Synthesis prediction is a key accelerator for the rapid design of advanced materials. However, determining synthesis variables such as the choice of precursor materials is challenging for inorganic materials because the sequence of reactions during heating is not well understood. In this work, we use a knowledge base of 29,900 solid-state synthesis recipes, text-mined from the scientific literature, to automatically learn which precursors to recommend for the synthesis of a novel target material. The data-driven approach learns chemical similarity of materials and refers the synthesis of a new target to precedent synthesis procedures of similar materials, mimicking human synthesis design. When proposing five precursor sets for each of 2654 unseen test target materials, the recommendation strategy achieves a success rate of at least 82%. Our approach captures decades of heuristic synthesis data in a mathematical form, making it accessible for use in recommendation engines and autonomous laboratories.
more »
« less
- Award ID(s):
- 1922372
- PAR ID:
- 10472899
- Publisher / Repository:
- American Association for the Advancement of Science
- Date Published:
- Journal Name:
- Science Advances
- Volume:
- 9
- Issue:
- 23
- ISSN:
- 2375-2548
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract A machine learning (ML) guided approach is presented for the accelerated optimization of chemical vapor deposition (CVD) synthesis of 2D materials toward the highest quality, starting from low‐quality or unsuccessful synthesis conditions. Using 26 sets of these synthesis conditions as the initial training dataset, our method systematically guides experimental synthesis towards optoelectronic‐grade monolayer MoS2flakes. A‐exciton linewidth (σA) as narrow as 38 meV could be achieved in 2D MoS2flakes after only an additional 35 trials (reflecting 15% of the full factorial design dataset for training purposes). In practical terms, this reflects a decrease of the possible experimental time to optimize the parameters from up to one year to about two months. This remarkable efficiency was achieved by formulating a constrained sequencing optimization problem solved via a combination of constraint learning and Bayesian Optimization with the narrowness of σAas the single target metric. By employing graph‐based semi‐supervised learning with data acquired through a multi‐criteria sampling method, the constraint model effectively delineates and refines the feasible design space for monolayer flake production. Additionally, the Gaussian Process regression effectively captures the relationships between synthesis parameters and outcomes, offering high predictive capability along with a measure of prediction uncertainty. This method is scalable to a higher number of synthesis parameters and target metrics and is transferrable to other materials and types of reactors. This study envisions that this method will be fundamental for CVD and similar techniques in the future.more » « less
-
Abstract Autonomous experimentation–or self-driving labs–offers a systematic approach to accelerate materials discovery by integrating automated synthesis, characterization, and data-driven decision-making. We present a closed-loop workflow for the on-demand synthesis and structural characterization of colloidal gold nanoparticles, enabling direct mapping from composition to nanoscale structure. Our framework leverages differentiable models of spectral shape to address two central tasks in self-driving labs: (a) phase mapping, or identifying compositional regions with distinct structural behavior; and (b) material retrosynthesis, or optimizing compositions for target structure. Using functional data analysis, we develop a data-driven model with generative pre-training, active learning, and high-throughput experiments to predict spectral responses across composition space. We demonstrate the approach on seed-mediated growth of gold nanoparticles, showcasing its ability to extract design rules, reveal secondary interactions, and efficiently navigate morphology space. Gradient-based optimization of the models enables inverse design, making this a unified platform.more » « less
-
Current synthesis techniques for metal oxide (MOx)-supported catalysts have certain limitations of undesired target loading, ineffective dispersion of active species over the surface, uncontrolled particle size of active species, and complicated synthesis steps. We developed a one-pot chemical vapor deposition (OP-CVD) methodology; by using which a solid metal precursor forms a vapor in a controlled condition and gets supported over the surrounding matrix. The theoretical stability followed by experimental validation using TGA is crucial for selecting the metal precursors. Three simple steps viz. premixing, dispersion, and rapid fixation by calcination are involved in the catalyst development via the OP-CVD approach. This study solely focused on the synthesis of 3d transition MOx over ceria support. The physicochemical characterizations of the prepared catalysts were performed by XRD, ICP-OES, SEM-EDX, CO pulse chemisorption, XANES, and EXAFS analyses to understand the crystal structure of involved species, target metal loading, dispersion, and particle size and prove the feasibility and viability of OP-CVD. The prepared catalysts were further tested for reverse water gas shift (RWGS) reaction to link their structural information with activity. The RWGS reaction data showed that the CO activity and CO selectivity were metal - and metal precursor-dependent. Higher CO activity of > 0.1 mol/h g-cat was observed for Cu and Co-based catalysts, with CO selectivity of ~100 %. This study provides an opportunity to produce effcient supported catalysts in a convenient way, providing effective catalytic activity.more » « less
-
Programming structures to realize any prescribed mechanical response under large deformation is highly desired for various functionalities, such as actuation and energy trapping. Yet, the use of a single material phase and heuristically developed structural patterns leads to restricted design space and potential failure to achieve specific target behaviors. Here, through a free-form inverse design approach, multiple hyperelastic materials with distinct properties are optimally synthesized into composite structures to precisely achieve arbitrary and extreme prescribed responses under large deformations. The digitally synthesized structures exhibit organic shapes and motions with irregular distributions of material phases. Within the structures, different materials play distinct roles yet seamlessly collaborate through sophisticated deformation mechanisms to produce the target behaviors, some of which are unachievable by a single material. While complex in geometry and material heterogeneity, the discovered structures are effectively manufactured via multimaterial fabrication with different polydimethylsiloxane (PDMS) elastomers with distinct behaviors and their highly nonlinear responses are physically and accurately realized in experiments. To enhance programmability, the synthesized structures are heteroassembled into architectures that exhibit highly complex yet navigable responses. The proposed synthesis, multimaterial fabrication, and heteroassembly strategy can be utilized to design function-oriented and situation-specific mechanical devices for a wide range of applications.more » « less
An official website of the United States government

