Identifying Chemical Reactions and Their Associated Attributes in Patents

Mahendran, Darshini; Gurdin, Gabrielle; Lewinski, Nastassja; Tang, Christina; McInnes, Bridget T.

doi:10.3389/frma.2021.688353

Citation Details

Identifying Chemical Reactions and Their Associated Attributes in Patents

Chemical patents are an essential source of information about novel chemicals and chemical reactions. However, with the increasing volume of such patents, mining information about these chemicals and chemical reactions has become a time-intensive and laborious endeavor. In this study, we present a system to extract chemical reaction events from patents automatically. Our approach consists of two steps: 1) named entity recognition (NER)—the automatic identification of chemical reaction parameters from the corresponding text, and 2) event extraction (EE)—the automatic classifying and linking of entities based on their relationships to each other. For our NER system, we evaluate bidirectional long short-term memory (BiLSTM)-based and bidirectional encoder representations from transformer (BERT)-based methods. For our EE system, we evaluate BERT-based, convolutional neural network (CNN)-based, and rule-based methods. We evaluate our NER and EE components independently and as an end-to-end system, reporting the precision, recall, and F 1 score. Our results show that the BiLSTM-based method performed best at identifying the entities, and the CNN-based method performed best at extracting events. more »

Award ID(s):: 1651957

PAR ID:: 10312160

Author(s) / Creator(s):: Mahendran, Darshini; Gurdin, Gabrielle; Lewinski, Nastassja; Tang, Christina; McInnes, Bridget T.

Date Published:: 2021-07-12

Journal Name:: Frontiers in Research Metrics and Analytics

Volume:: 6

ISSN:: 2504-0537

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.3389/frma.2021.688353

More Like this