首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:A New Method for Calculating Similarity between Sentences and Application on Automatic Abstracting
  • 本地全文:下载
  • 作者:Wenqian JI ; Zhoujun LI ; Wenhan CHAO
  • 期刊名称:Intelligent Information Management
  • 印刷版ISSN:2150-8194
  • 电子版ISSN:2150-8208
  • 出版年度:2009
  • 卷号:1
  • 期号:1
  • 页码:36-42
  • DOI:10.4236/iim.2009.11007
  • 出版社:Scientific Research Publishing
  • 摘要:Sentence similarity computing plays an important role in machine question-answering systems, machine-translation systems, information retrieval and automatic abstracting systems. This article firstly sums up several methods for calculating similarity between sentences, and brings out a new method which takes all factors into consideration including critical words, semantic information, sentential form and sen-tence length. And on this basis, a automatic abstracting system based on LexRank algorithm is implemented. We made several improvements in both sentence weight computing and redundancy resolution. The system described in this article could deal with single or multi-document summarization both in English and Chinese. With evaluations on two corpuses, our system could produce better summaries to a certain degree. We also show that our system is quite insensitive to the noise in the data that may result from an imperfect topical clustering of documents. And in the end, existing problem and the developing trend of automatic summariza-tion technology are discussed.
  • 关键词:sentence similarity; automatic abstracting; lexrank; sentence-weight computing; redundancy resolution
国家哲学社会科学文献中心版权所有