Improving Simultaneous Translation by Incorporating Pseudo-References with Fewer Reorderings

Chen, Junkun; Zheng, Renjie; Kita, Atsuhito; Ma, Mingbo; Huang, Liang

doi:10.18653/v1/2021.emnlp-main.473

Citation Details

Improving Simultaneous Translation by Incorporating Pseudo-References with Fewer Reorderings

Simultaneous translation is vastly different from full-sentence translation, in the sense that it starts translation before the source sentence ends, with only a few words delay. However, due to the lack of large-scale, high-quality simultaneous translation datasets, most such systems are still trained on conventional full-sentence bitexts. This is far from ideal for the simultaneous scenario due to the abundance of unnecessary long-distance reorderings in those bitexts. We propose a novel method that rewrites the target side of existing full-sentence corpora into simultaneous-style translation. Experiments on Zh→En and Ja→En simultaneous translation show substantial improvements (up to +2.7 BLEU) with the addition of these generated pseudo-references. more »

Award ID(s):: 2009071 1817231

PAR ID:: 10398231

Author(s) / Creator(s):: Chen, Junkun; Zheng, Renjie; Kita, Atsuhito; Ma, Mingbo; Huang, Liang

Date Published:: 2021-01-01

Journal Name:: Proceedings of EMNLP 2021

Page Range / eLocation ID:: 5857 to 5864

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2021.emnlp-main.473

More Like this