Cross-lingual annotation: a road map for low- and no-resource languages

Vigus, Meagan; Van Gysel, Jens E.; O'Gorman, Tim; Cowell, Andres; Vallejos, Rosa; Croft, William

Citation Details

This paper presents a “road map” for the annotation of semantic categories in typologically diverse languages, with potentially few linguistic resources, and often no existing computational resources. Past semantic annotation efforts have focused largely on high-resource languages, or relatively low-resource languages with a large number of native speakers. However, there are certain typological traits, namely the synthesis of multiple concepts into a single word, that are more common in languages with a smaller speech community. For example, what is expressed as a sentence in a more analytic language like English, may be expressed as a single word in a more synthetic language like Arapaho. This paper proposes solutions for annotating analytic and synthetic languages in a comparable way based on existing typological research, and introduces a road map for the annotation of languages with a dearth of resources. more »

Award ID(s):: 1764091

PAR ID:: 10401357

Author(s) / Creator(s):: Vigus, Meagan; Van Gysel, Jens E.; O'Gorman, Tim; Cowell, Andres; Vallejos, Rosa; Croft, William

Date Published:: 2020-01-01

Journal Name:: Proceedings of the Second International Workshop on Designing Meaning Representations (DMR 2020)

Page Range / eLocation ID:: 30-40

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this