首页    期刊浏览 2024年12月03日 星期二
登录注册

文章基本信息

  • 标题:Analyzing Error Types in English-Czech Machine Translation
  • 作者:Ondřej Bojar
  • 期刊名称:The Prague Bulletin of Mathematical Linguistics
  • 印刷版ISSN:0032-6585
  • 电子版ISSN:1804-0462
  • 出版年度:2011
  • 卷号:95
  • 期号:1
  • 页码:63-76
  • DOI:10.2478/v10108-011-0005-2
  • 语种:English
  • 出版社:Walter de Gruyter GmbH
  • 摘要:This paper examines two techniques of manual evaluation that can be used to identify error types of individual machine translation systems. The first technique of "blind post-editing" is being used in WMT evaluation campaigns since 2009 and manually constructed data of this type are available for various language pairs. The second technique of explicit marking of errors has been used in the past as well. We propose a method for interpreting blind post-editing data at a finer level and compare the results with explicit marking of errors. While the human annotation of either of the techniques is not exactly reproducible (relatively low agreement), both techniques lead to similar observations of differences of the systems. Specifically, we are able to suggest which errors in MT output are easy and hard to correct with no access to the source, a situation experienced by users who do not understand the source language.
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有