Minimally-Supervised Morphological Segmentation using Adaptor Grammars with Linguistic Priors

Eskander, Ramy; Lowry, Cass; Khandagale, Sujay; Callejas, Francesca; Klavans, Judith; Polinsky, Maria; Muresan, Smaranda

doi:10.18653/v1/2021.findings-acl.347

Citation Details

Minimally-Supervised Morphological Segmentation using Adaptor Grammars with Linguistic Priors

With the increasing interest in low-resource languages, unsupervised morphological segmentation has become an active area of research, where approaches based on Adaptor Grammars achieve state-of-the-art results. We demonstrate the power of harnessing linguistic knowledge as priors within Adaptor Grammars in a minimally-supervised learning fashion. We introduce two types of priors: 1) grammar definition, where we design language-specific grammars; and 2) linguistprovided affixes, collected by an expert in the language and seeded into the grammars. We use Japanese and Georgian as respective case studies for the two types of priors and introduce new datasets for these languages, with gold morphological segmentation for evaluation. We show that the use of priors results in error reductions of 8.9 % and 34.2 %, respectively, over the equivalent state-of-the-art unsupervised system more »

Award ID(s):: 1941733 1941742

PAR ID:: 10320447

Author(s) / Creator(s):: Eskander, Ramy; Lowry, Cass; Khandagale, Sujay; Callejas, Francesca; Klavans, Judith; Polinsky, Maria; Muresan, Smaranda

Date Published:: 2021-08-01

Journal Name:: Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2021.findings-acl.347

More Like this