Neural machine translation for low-resource languages: A Survey
Loading...
Date
2023
Journal Title
Journal ISSN
Volume Title
Publisher
Association for Computing Machinery
Abstract
Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since the early 2000s and has already entered a mature phase. While considered the most widely used solution for Machine Translation, its performance on low-resource language pairs remains sub-optimal compared to the high-resource counterparts due to the unavailability of large parallel corpora. Therefore, the implementation of NMT techniques for low-resource language pairs has been receiving the spotlight recently, thus leading to substantial research on this topic. This article presents a detailed survey of research advancements in low-resource language NMT (LRL-NMT) and quantitative analysis to identify the most popular techniques. We provide guidelines to select the possible NMT technique for a given LRL data setting based on our findings. We also present a holistic view of the LRL-NMT research landscape and provide recommendations to enhance the research efforts further.
Description
Keywords
Low-Resource Languages, Unsupervised NMT, Neural Machine Translation, Semi-supervised NMT, Multilingual NMT, Transfer Learning, Data Augmentation, Zero-shot Translation, Pivoting
Citation
Ranathunga, S., Lee, E.-S. A., Prifti Skenduli, M., Shekhar, R., Alam, M., & Kaur, R. (2023). Neural Machine Translation for Low-resource Languages: A Survey. ACM Computing Surveys, 55(11), 229 (1-37). https://doi.org/10.1145/3567592