Adapting Large Language Models for Character-based Augmentative and Alternative Communication

Gaines, Dylan; Vertanen, Keith

doi:10.18653/v1/2025.findings-emnlp.826

Citation Details

Adapting Large Language Models for Character-based Augmentative and Alternative Communication

Users of Augmentative and Alternative Communication (AAC) may write letter-by-letter via an interface that uses a character language model. However, most state-of-the-art large pretrained language models predict subword tokens of variable length. We investigate how to practically use such models to make accurate and efficient character predictions. Our algorithm for producing character predictions from a subword large language model (LLM) provides more accurate predictions than using a classification layer, a byte-level LLM, or an n-gram model. Additionally, we investigate a domain adaptation procedure based on a large dataset of sentences we curated based on scoring how useful each sentence might be for spoken or written AAC communication. We find our procedure further improves model performance on simple, conversational text. more »

Award ID(s):: 1750193 2402876

PAR ID:: 10648029

Author(s) / Creator(s):: Gaines, Dylan; Vertanen, Keith

Publisher / Repository:: Association for Computational Linguistics

Date Published:: 2025-01-01

Page Range / eLocation ID:: 15273 to 15291

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2025.findings-emnlp.826

More Like this