首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:Improving English-to-Indian Language Neural Machine Translation Systems
  • 本地全文:下载
  • 作者:Akshara Kandimalla ; Pintu Lohar ; Souvik Kumar Maji
  • 期刊名称:Information
  • 电子版ISSN:2078-2489
  • 出版年度:2022
  • 卷号:13
  • 期号:5
  • 页码:245
  • DOI:10.3390/info13050245
  • 语种:English
  • 出版社:MDPI Publishing
  • 摘要:Most Indian languages lack sufficient parallel data for Machine Translation (MT) training. In this study, we build English-to-Indian language Neural Machine Translation (NMT) systems using the state-of-the-art transformer architecture. In addition, we investigate the utility of back-translation and its effect on system performance. Our experimental evaluation reveals that the back-translation method helps to improve the BLEU scores for both English-to-Hindi and English-to-Bengali NMT systems. We also observe that back-translation is more useful in improving the quality of weaker baseline MT systems. In addition, we perform a manual evaluation of the translation outputs and observe that the BLEU metric cannot always analyse the MT quality as well as humans. Our analysis shows that MT outputs for the English–Bengali pair are actually better than that evaluated by BLEU metric.
国家哲学社会科学文献中心版权所有