NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Explaining with Contrastive Phrasal Highlighting: A Case Study in Assisting Humans to Detect Translation Differences

https://doi.org/10.18653/v1/2023.emnlp-main.690

Briakou, Eleftheria; Goyal, Navita; Carpuat, Marine (December 2023, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing)

Explainable NLP techniques primarily explain by answering “Which tokens in the input are responsible for this prediction?”. We argue that for NLP models that make predictions by comparing two input texts, it is more useful to explain by answering “What differences between the two inputs explain this prediction?”. We introduce a technique to generate contrastive phrasal highlights that explain the predictions of a semantic divergence model via phrase alignment guided erasure. We show that the resulting highlights match human rationales of cross-lingual semantic differences better than popular post-hoc saliency techniques and that they successfully help people detect fine-grained meaning differences in human translations and critical machine translation errors.
more » « less
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection

https://doi.org/10.1162/tacl_a_00563

Xu, Weijia; Agrawal, Sweta; Briakou, Eleftheria; Martindale, Marianna J; Carpuat, Marine (January 2023, Transactions of the Association for Computational Linguistics)

Neural sequence generation models are known to “hallucinate”, by producing outputs that are unrelated to the source text. These hallucinations are potentially harmful, yet it remains unclear in what conditions they arise and how to mitigate their impact. In this work, we first identify internal model symptoms of hallucinations by analyzing the relative token contributions to the generation in contrastive hallucinated vs. non-hallucinated outputs generated via source perturbations. We then show that these symptoms are reliable indicators of natural hallucinations, by using them to design a lightweight hallucination detector which outperforms both model-free baselines and strong classifiers based on quality estimation or large pre-trained models on manually annotated English-Chinese and German-English translation test beds.
more » « less
Full Text Available
Bridging Background Knowledge Gaps in Translation with Automatic Explicitation

https://doi.org/10.18653/v1/2023.emnlp-main.603

Han, HyoJung; Boyd-Graber, Jordan; Carpuat, Marine (January 2023, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing)

Translations help people understand content written in another language. However, even correct literal translations do not fulfill that goal when people lack the necessary background to understand them. Professional translators incorporate explicitations to explain the missing context by considering cultural differences between source and target audiences. Despite its potential to help users, NLP research on explicitation is limited because of the dearth of adequate evaluation methods. This work introduces techniques for automatically generating explicitations, motivated by WikiExpl: a dataset that we collect from Wikipedia and annotate with human translators. The resulting explicitations are useful as they help answer questions more accurately in a multilingual question answering framework.
more » « less
SimQA: Detecting Simultaneous MT Errors through Word-by-Word Question Answering

HyoJung Han; Marine Carpuat; Jordan Boyd-Graber (December 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
Can Synthetic Translations Improve Bitext Quality?

https://doi.org/10.18653/v1/2022.acl-long.326

Briakou, Eleftheria; Carpuat, Marine (May 2022, Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Synthetic translations have been used for a wide range of NLP tasks primarily as a means of data augmentation. This work explores, instead, how synthetic translations can be used to revise potentially imperfect reference translations in mined bitext. We find that synthetic samples can improve bitext quality without any additional bilingual supervision when they replace the originals based on a semantic equivalence classifier that helps mitigate NMT noise. The improved quality of the revised bitext is confirmed intrinsically via human evaluation and extrinsically through bilingual induction and MT tasks.
more » « less
Full Text Available
Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on Neural Machine Translation

https://doi.org/10.18653/v1/2021.acl-long.562

Briakou, Eleftheria; Carpuat, Marine (January 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers))

Full Text Available
Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank

https://doi.org/10.18653/v1/2020.emnlp-main.121

Briakou, Eleftheria; Carpuat, Marine (November 2020, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP))
null (Ed.)
Detecting fine-grained differences in content conveyed in different languages matters for cross-lingual NLP and multilingual corpora analysis, but it is a challenging machine learning problem since annotation is expensive and hard to scale. This work improves the prediction and annotation of fine-grained semantic divergences. We introduce a training strategy for multilingual BERT models by learning to rank synthetic divergent examples of varying granularity. We evaluate our models on the Rationalized English-French Semantic Divergences, a new dataset released with this work, consisting of English-French sentence-pairs annotated with semantic divergence classes and token-level rationales. Learning to rank helps detect fine-grained sentence-level divergences more accurately than a strong sentence-level similarity model, while token-level predictions have the potential of further distinguishing between coarse and fine-grained divergences.
more » « less
Full Text Available
Weakly Supervised Cross-lingual Semantic Relation Classification via Knowledge Distillation

https://doi.org/10.18653/v1/D19-1532

Vyas, Yogarshi; Carpuat, Marine (November 2019, 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP))

Full Text Available

Search for: All records