An Evaluation of Croatian ASR Models for Čakavian Transcription

Zhang, Shulin; Hale, John; Renwick, Margaret; Vrzić, Zvjezdana; Langston, Keith

Citation Details

To assist in the documentation of Čakavian, an endangered language variety closely related to Croatian, we test four currently available ASR models that are trained with Croatian data and assess their performance in the transcription of Čakavian audio data. We compare the models’ word error rates, analyze the word-level error types, and showcase the most frequent Deletion and Substitution errors. The evaluation results indicate that the best-performing system for transcribing Čakavian was a CTC-based variant of the Conformer model. more »

Award ID(s):: 2220425

PAR ID:: 10525927

Author(s) / Creator(s):: Zhang, Shulin; Hale, John; Renwick, Margaret; Vrzić, Zvjezdana; Langston, Keith

Publisher / Repository:: ACL Anthology

Date Published:: 2024-05-31

Subject(s) / Keyword(s):: Croatian Čakavian ASR transcription endangered language documentation

Format(s):: Medium: X

Location:: Torino, Italy

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this