Multilingual Simplification of Medical Texts

Joseph, Sebastian; Kazanas, Kathryn; Reina, Keziah; Ramanathan, Vishnesh; Xu, Wei; Wallace, Byron; Li, Junyi

doi:10.18653/v1/2023.emnlp-main.1037

Citation Details

Multilingual Simplification of Medical Texts

Automated text simplification aims to produce simple versions of complex texts. This task is especially useful in the medical domain, where the latest medical findings are typically communicated via complex and technical articles. This creates barriers for laypeople seeking access to up-to-date medical findings, consequently impeding progress on health literacy. Most existing work on medical text simplification has focused on monolingual settings, with the result that such evidence would be available only in just one language (most often, English). This work addresses this limitation via multilingual simplification, i.e., directly simplifying complex texts into simplified texts in multiple languages. We introduce MultiCochrane, the first sentence-aligned multilingual text simplification dataset for the medical domain in four languages: English, Spanish, French, and Farsi. We evaluate fine-tuned and zero-shot models across these languages with extensive human assessments and analyses. Although models can generate viable simplified texts, we identify several outstanding challenges that this dataset might be used to address. more »

Award ID(s):: 2144493 2145479

PAR ID:: 10511650

Author(s) / Creator(s):: Joseph, Sebastian; Kazanas, Kathryn; Reina, Keziah; Ramanathan, Vishnesh; Xu, Wei; Wallace, Byron; Li, Junyi

Publisher / Repository:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

Date Published:: 2023-12-01

Journal Name:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

Page Range / eLocation ID:: 16662 to 16692

Format(s):: Medium: X

Location:: Singapore

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2023.emnlp-main.1037

More Like this