Introducing a Parsed Corpus of Historical High German

Sapp, Christopher D; Evans, Elliott; Sprouse, Rex; Dakota, Daniel

Citation Details

We outline the ongoing development of the Indiana Parsed Corpus of (Historical) High German. Once completed, this corpus will fill the gap in Penn-style treebanks for Germanic languages by spanning High German from 1050 to 1950. This paper describes the process of building the corpus: selection of texts, decisions on part-of-speech tags and other labels, the process of annotation, and illustrative annotation issues unique to historical High German. The construction of the corpus has led to a refinement of the Penn labels, tailored to the particulars of this language. more »

Award ID(s):: 2314522

PAR ID:: 10530419

Author(s) / Creator(s):: Sapp, Christopher D; Evans, Elliott; Sprouse, Rex; Dakota, Daniel

Editor(s):: Calzolari, Nicoletta; Kan, Min-Yen; Hoste, Veronique; Lenci, Alessandro; Sakti, Sakriani; Xue, Nianwen

Publisher / Repository:: ACL Anthology

Date Published:: 2024-03-27

Format(s):: Medium: X

Location:: https://aclanthology.org/2024.lrec-main.807/

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this