首页    期刊浏览 2024年12月12日 星期四
登录注册

文章基本信息

  • 标题:Representing Discourse Coherence: A Corpus-Based Study
  • 本地全文:下载
  • 作者:Florian Wolf ; Edward Gibson
  • 期刊名称:Computational Linguistics
  • 印刷版ISSN:0891-2017
  • 电子版ISSN:1530-9312
  • 出版年度:2005
  • 卷号:31
  • 期号:2
  • 页码:249-287
  • DOI:10.1162/0891201054223977
  • 语种:English
  • 出版社:MIT Press
  • 摘要:This article aims to present a set of discourse structure relations that are easy to code and to develop criteria for an appropriate data structure for representing these relations. Discourse structure here refers to informational relations that hold between sentences in a discourse. The set of discourse relations introduced here is based on Hobbs (1985). We present a method for annotating discourse coherence structures that we used to manually annotate a database of 135 texts from the Wall Street Journal and the AP Newswire. Alltexts were independently annotated by two annotators. Kappa values of greater than 0.8 indicated good interannotator agreement. We furthermore present evidence that trees are not a descriptively adequate data structure for representing discourse structure: In coherence structures of naturally occurring texts, we found many different kinds of crossed dependencies, as well as many nodes with multiple parents. The claims are supported by statistical results from our hand-annotated database of 135 texts.
国家哲学社会科学文献中心版权所有