NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Augmented Spanish-Persian Neural Machine Translation [Augmented Spanish-Persian Neural Machine Translation]

https://doi.org/10.5220/0010369804820488

Ahmadnia, Benyamin; Aranovich, Raul (January 2021, Proceedings of the 13th International Conference on Agents and Artificial Intelligence (ICAART 2021))
null (Ed.)
Neural Machine Translation (NMT) performs training of a neural network employing an encoder-decoder architecture. However, the quality of the neural-based translations predominantly depends on the availability of a large amount of bilingual training dataset. In this paper, we explore the performance of translations predicted by attention-based NMT systems for Spanish to Persian low-resource language pairs. We analyze the errors of NMT systems that occur in the Persian language and provide an in-depth comparison of the performance of the system based on variations in sentence length and size of the training dataset. We evaluate our translation results using BLEU and human evaluation measures based on the adequacy, fluency, and overall rating.
more » « less
Full Text Available
Beyond NVD: Cybersecurity meets the Semantic Web.

https://doi.org/10.1145/3498891.3501259

Aranovich, Raúl; Wu, Muting; Yu, Dian; Katsy, Katya; Ahmadnia, Benyamin; Bishop, Matthew; Filkov, Vladimir; Sagae, Kenji (October 2021, NSPW '21: New Security Paradigms Workshop)

Full Text Available
Strengthening Low-resource Neural Machine Translation through Joint Learning: The Case of Farsi-Spanish [Strengthening Low-resource Neural Machine Translation through Joint Learning: The Case of Farsi-Spanish]

https://doi.org/10.5220/0010362604750481

Ahmadnia, Benyamin; Aranovich, Raul; Dorr, Bonnie (January 2021, Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 1: NLPinAI)
null (Ed.)
This paper describes a systematic study of an approach to Farsi-Spanish low-resource Neural Machine Translation (NMT) that leverages monolingual data for joint learning of forward and backward translation models. As is standard for NMT systems, the training process begins using two pre-trained translation models that are iteratively updated by decreasing translation costs. In each iteration, either translation model is used to translate monolingual texts from one language to another, to generate synthetic datasets for the other translation model. Two new translation models are then learned from bilingual data along with the synthetic texts. The key distinguishing feature between our approach and standard NMT is an iterative learning process that improves the performance of both translation models, simultaneously producing a higher-quality synthetic training dataset upon each iteration. Our empirical results demonstrate that this approach outperforms baselines.
more » « less
Full Text Available
Impact of Filtering Generated Pseudo Bilingual Texts in Low-Resource Neural Machine Translation Enhancement: The Case of Persian-Spanish

https://doi.org/10.1016/j.procs.2021.05.093

Ahmadnia, Benyamin; Dorr, Bonnie J.; Aranovich, Raul (January 2021, Procedia Computer Science)
null (Ed.)
Full Text Available

Search for: All records