NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Do Text Simplification Systems Preserve Meaning? A Human Evaluation via Reading Comprehension

https://doi.org/10.1162/tacl_a_00653

Agrawal, Sweta; Carpuat, Marine (January 2024, Transactions of the Association for Computational Linguistics)

Abstract Automatic text simplification (TS) aims to automate the process of rewriting text to make it easier for people to read. A pre-requisite for TS to be useful is that it should convey information that is consistent with the meaning of the original text. However, current TS evaluation protocols assess system outputs for simplicity and meaning preservation without regard for the document context in which output sentences occur and for how people understand them. In this work, we introduce a human evaluation framework to assess whether simplified texts preserve meaning using reading comprehension questions. With this framework, we conduct a thorough human evaluation of texts by humans and by nine automatic systems. Supervised systems that leverage pre-training knowledge achieve the highest scores on the reading comprehension tasks among the automatic controllable TS systems. However, even the best-performing supervised system struggles with at least 14% of the questions, marking them as “unanswerable” based on simplified content. We further investigate how existing TS evaluation metrics and automatic question-answering systems approximate the human judgments we obtained.
more » « less
Full Text Available
Sustaining Human Agency, Attending to Its Cost: An Investigation into Generative AI Design for Non-Native Speakers' Language Use

https://doi.org/10.1145/3706598.3713626

Xiao, Yimin; Hancock, Cartor; Agrawal, Sweta; Mehandru, Nikita; Salehi, Niloufar; Carpuat, Marine; Gao, Ge (April 2025, ACM)

Free, publicly-accessible full text available April 25, 2026
Designing AI-Based Language Tools for Non-Native Speakers' Language Use and Development

https://doi.org/10.1145/3706599.3721094

Xiao, Yimin (April 2025, ACM)

Free, publicly-accessible full text available April 25, 2026
Automatic Input Rewriting Improves Translation with Large Language Models

https://doi.org/10.18653/v1/2025.naacl-long.542

Ki, Dayeon; Carpuat, Marine (January 2025, Association for Computational Linguistics)

Full Text Available
Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

https://doi.org/10.18653/v1/2024.findings-naacl.265

Ki, Dayeon; Carpuat, Marine (January 2024, Association for Computational Linguistics)

Full Text Available
Explaining with Contrastive Phrasal Highlighting: A Case Study in Assisting Humans to Detect Translation Differences

https://doi.org/10.18653/v1/2023.emnlp-main.690

Briakou, Eleftheria; Goyal, Navita; Carpuat, Marine (December 2023, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing)

Explainable NLP techniques primarily explain by answering “Which tokens in the input are responsible for this prediction?”. We argue that for NLP models that make predictions by comparing two input texts, it is more useful to explain by answering “What differences between the two inputs explain this prediction?”. We introduce a technique to generate contrastive phrasal highlights that explain the predictions of a semantic divergence model via phrase alignment guided erasure. We show that the resulting highlights match human rationales of cross-lingual semantic differences better than popular post-hoc saliency techniques and that they successfully help people detect fine-grained meaning differences in human translations and critical machine translation errors.
more » « less
Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors

https://doi.org/10.18653/v1/2023.emnlp-main.712

Mehandru, Nikita; Agrawal, Sweta; Xiao, Yimin; Gao, Ge; Khoong, Elaine; Carpuat, Marine; Salehi, Niloufar (January 2023, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing)

A major challenge in the practical use of Machine Translation (MT) is that users lack information on translation quality to make informed decisions about how to rely on outputs. Progress in quality estimation research provides techniques to automatically assess MT quality, but these techniques have primarily been evaluated in vitro by comparison against human judgments outside of a specific context of use. This paper evaluates quality estimation feedback in vivo with a human study in realistic high-stakes medical settings. Using Emergency Department discharge instructions, we study how interventions based on quality estimation versus backtranslation assist physicians in deciding whether to show MT outputs to a patient. We find that quality estimation improves appropriate reliance on MT, but backtranslation helps physicians detect more clinically harmful errors that QE alone often misses.
more » « less
Bridging Background Knowledge Gaps in Translation with Automatic Explicitation

https://doi.org/10.18653/v1/2023.emnlp-main.603

Han, HyoJung; Boyd-Graber, Jordan; Carpuat, Marine (January 2023, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing)

Translations help people understand content written in another language. However, even correct literal translations do not fulfill that goal when people lack the necessary background to understand them. Professional translators incorporate explicitations to explain the missing context by considering cultural differences between source and target audiences. Despite its potential to help users, NLP research on explicitation is limited because of the dearth of adequate evaluation methods. This work introduces techniques for automatically generating explicitations, motivated by WikiExpl: a dataset that we collect from Wikipedia and annotate with human translators. The resulting explicitations are useful as they help answer questions more accurately in a multilingual question answering framework.
more » « less
Controlling Pre-trained Language Models for Grade-Specific Text Simplification

https://doi.org/10.18653/v1/2023.emnlp-main.790

Agrawal, Sweta; Carpuat, Marine (January 2023, Association for Computational Linguistics)

Full Text Available
Quality Estimation via Backtranslation at the WMT 2022 Quality Estimation Task

Sweta Agrawal, Nikita Mehandru (December 2022, Proceedings of the Seventh Conference on Machine Translation (WMT))

Full Text Available

« Prev Next »

Search for: All records