An Analysis of Deep Contextual Word Embeddings and Neural Architectures for Toponym Mention Detection in Scientific Publications

Magnusson, Matthew; Dietz, Laura

doi:10.18653/v1/W19-2607

Citation Details

An Analysis of Deep Contextual Word Embeddings and Neural Architectures for Toponym Mention Detection in Scientific Publications

Toponym detection in scientific papers is an open task and a key first step in place entity enrichment of documents. We examine three common neural architectures in NLP: 1) convolutional neural network, 2) multi-layer perceptron (both applied in a sliding window context) and 3) bidirectional LSTM and apply contextual and non-contextual word embedding layers to these models. We find that deep contextual word embeddings improve the performance of the bi-LSTM with CRF neural architecture achieving the best performance when multiple layers of deep contextual embeddings are concatenated. Our best performing model achieves an average F1 of 0.910 when evaluated on overlap macro exceeding previous state-of-the-art models in the toponym detection task. more »

Award ID(s):: 1846017

PAR ID:: 10120524

Author(s) / Creator(s):: Magnusson, Matthew; Dietz, Laura

Date Published:: 2019-05-01

Journal Name:: Proceedings of the Workshop on Extracting Structured Knowledge from Scientific Publications

Page Range / eLocation ID:: 48 to 56

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/W19-2607

More Like this