Attending to Long-Distance Document Context for Sequence Labeling

Jörke, Matthew; Gillick, Jon; Sims, Matthew; Bamman, David

doi:10.18653/v1/2020.findings-emnlp.330

Citation Details

Attending to Long-Distance Document Context for Sequence Labeling

We present in this work a method for incorporating global context in long documents when making local decisions in sequence labeling problems like NER. Inspired by work in featurized log-linear models (Chieu and Ng, 2002; Sutton and McCallum, 2004), our model learns to attend to multiple mentions of the same word type in generating a representation for each token in context, extending that work to learning representations that can be incorporated into modern neural models. Attending to broader context at test time provides complementary information to pretraining (Gururangan et al., 2020), yields strong gains over equivalently parameterized models lacking such context, and performs best at recognizing entities with high TF-IDF scores (i.e., those that are important within a document). more »

Award ID(s):: 1813470

PAR ID:: 10274011

Author(s) / Creator(s):: Jörke, Matthew; Gillick, Jon; Sims, Matthew; Bamman, David

Date Published:: 2020-01-01

Journal Name:: Findings of the Association for Computational Linguistics: EMNLP 2020

Page Range / eLocation ID:: 3692 to 3704

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2020.findings-emnlp.330

More Like this