ULSA: unified language of synthesis actions for the representation of inorganic synthesis protocols

Wang, Zheren; Cruse, Kevin; Fei, Yuxing; Chia, Ann; Zeng, Yan; Huo, Haoyan; He, Tanjin; Deng, Bowen; Kononova, Olga; Ceder, Gerbrand

doi:10.1039/d1dd00034a

Citation Details

ULSA: unified language of synthesis actions for the representation of inorganic synthesis protocols

Applying AI power to predict syntheses of novel materials requires high-quality, large-scale datasets. Extraction of synthesis information from scientific publications is still challenging, especially for extracting synthesis actions, because of the lack of a comprehensive labeled dataset using a solid, robust, and well-established ontology for describing synthesis procedures. In this work, we propose the first unified language of synthesis actions (ULSA) for describing inorganic synthesis procedures. We created a dataset of 3040 synthesis procedures annotated by domain experts according to the proposed ULSA scheme. To demonstrate the capabilities of ULSA, we built a neural network-based model to map arbitrary inorganic synthesis paragraphs into ULSA and used it to construct synthesis flowcharts for synthesis procedures. Analysis of the flowcharts showed that (a) ULSA covers essential vocabulary used by researchers when describing synthesis procedures and (b) it can capture important features of synthesis protocols. The present work focuses on the synthesis protocols for solid-state, sol–gel, and solution-based inorganic synthesis, but the language could be extended in the future to include other synthesis methods. This work is an important step towards creating a synthesis ontology and a solid foundation for autonomous robotic synthesis. more »

Award ID(s):: 1922372

PAR ID:: 10397930

Author(s) / Creator(s):: Wang, Zheren; Cruse, Kevin; Fei, Yuxing; Chia, Ann; Zeng, Yan; Huo, Haoyan; He, Tanjin; Deng, Bowen; Kononova, Olga; Ceder, Gerbrand

Date Published:: 2022-06-13

Journal Name:: Digital Discovery

Volume:: 1

Issue:: 3

ISSN:: 2635-098X

Page Range / eLocation ID:: 313 to 324

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1039/d1dd00034a

More Like this