Entity Linking via Explicit Mention-Mention Coreference Modeling

Agarwal, Dhruv; Angell, Rico; Monath, Nicholas; McCallum, Andrew

doi:10.18653/v1/2022.naacl-main.343

Citation Details

Entity Linking via Explicit Mention-Mention Coreference Modeling

Learning representations of entity mentions is a core component of modern entity linking systems for both candidate generation and making linking predictions. In this paper, we present and empirically analyze a novel training approach for learning mention and entity representations that is based on building minimum spanning arborescences (i.e., directed spanning trees) over mentions and entities across documents to explicitly model mention coreference relationships. We demonstrate the efficacy of our approach by showing significant improvements in both candidate generation recall and linking accuracy on the Zero-Shot Entity Linking dataset and MedMentions, the largest publicly available biomedical dataset. In addition, we show that our improvements in candidate generation yield higher quality re-ranking models downstream, setting a new SOTA result in linking accuracy on MedMentions. Finally, we demonstrate that our improved mention representations are also effective for the discovery of new entities via cross-document coreference. more »

Award ID(s):: 1763618

NSF-PAR ID:: 10356092

Author(s) / Creator(s):: Agarwal, Dhruv; Angell, Rico; Monath, Nicholas; McCallum, Andrew

Date Published:: 2022-07-01

Journal Name:: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Page Range / eLocation ID:: 4644 to 4658

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2022.naacl-main.343

More Like this