首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:How to Translate Dialects: A Segmentation-Centric Pivot Translation Approach
  • 本地全文:下载
  • 作者:Michael Paul ; Andrew Finch ; Eiichiro Sumita
  • 期刊名称:Information and Media Technologies
  • 电子版ISSN:1881-0896
  • 出版年度:2013
  • 卷号:8
  • 期号:4
  • 页码:1166-1186
  • DOI:10.11185/imt.8.1166
  • 出版社:Information and Media Technologies Editorial Board
  • 摘要:Recent research on multilingual statistical machine translation (SMT) focuses on the usage of pivot languages in order to overcome resource limitations for certain language pairs. This paper proposes a new method to translate a dialect language into a foreign language by integrating transliteration approaches based on Bayesian alignment (BA) models with pivot-based SMT approaches. The advantages of the proposed method with respect to standard SMT approaches are threefold: (1) it uses a standard language as the pivot language and acquires knowledge about the relation between dialects and a standard language automatically, (2) it avoids segmentation mismatches between the input and the translation model by mapping the character sequences of the dialect language to the word segmentation of the standard language, and (3) it reduces the translation task complexity by using monotone decoding techniques. Experiment results translating five Japanese dialects (Kumamoto, Kyoto, Nagoya, Okinawa, Osaka) into four Indo-European languages (English, German, Russian, Hindi) and two Asian languages (Chinese, Korean) revealed that the proposed method improves the translation quality of dialect translation tasks and outperforms standard pivot translation approaches concatenating SMT engines for the majority of the investigated language pairs.
  • 关键词:Dialect Languages;Pivot Translation;Word Segmentation
国家哲学社会科学文献中心版权所有