Revisiting the Effects of Leakage on Dependency Parsing

Krasner, Nathaniel; Wanner, Miriam; Anastasopoulos, Antonios

Citation Details

Recent work by Søgaard (2020) showed that, treebank size aside, overlap between training and test graphs (termed leakage) explains more of the observed variation in dependency parsing performance than other explanations. In this work we revisit this claim, testing it on more models and languages. We find that it only holds for zero-shot cross-lingual settings. We then propose a more fine-grained measure of such leakage which, unlike the original measure, not only explains but also correlates with observed performance variation. more »

Award ID(s):: 1757064

PAR ID:: 10327898

Author(s) / Creator(s):: Krasner, Nathaniel; Wanner, Miriam; Anastasopoulos, Antonios

Date Published:: 2022-05-01

Journal Name:: Transactions of the Association for Computational Linguistics

Volume:: ACL 2022

ISSN:: 2307-387X

Page Range / eLocation ID:: 2925-2934

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this