首页    期刊浏览 2024年12月12日 星期四
登录注册

文章基本信息

  • 标题:Summarising Historical Text in Modern Languages
  • 本地全文:下载
  • 作者:Xutan Peng ; Yi Zheng ; Chenghua Lin
  • 期刊名称:Conference on European Chapter of the Association for Computational Linguistics (EACL)
  • 出版年度:2021
  • 卷号:2021
  • 页码:3123-3142
  • DOI:10.18653/v1/2021.eacl-main.273
  • 语种:English
  • 出版社:ACL Anthology
  • 摘要:We introduce the task of historical text summarisation, where documents in historical forms of a language are summarised in the corresponding modern language. This is a fundamentally important routine to historians and digital humanities researchers but has never been automated. We compile a high-quality gold-standard text summarisation dataset, which consists of historical German and Chinese news from hundreds of years ago summarised in modern German or Chinese. Based on cross-lingual transfer learning techniques, we propose a summarisation model that can be trained even with no cross-lingual (historical to modern) parallel data, and further benchmark it against state-of-the-art algorithms. We report automatic and human evaluations that distinguish the historic to modern language summarisation task from standard cross-lingual summarisation (i.e., modern to modern language), highlight the distinctness and value of our dataset, and demonstrate that our transfer learning approach outperforms standard cross-lingual benchmarks on this task.
国家哲学社会科学文献中心版权所有