首页    期刊浏览 2025年03月03日 星期一
登录注册

文章基本信息

  • 标题:A Graph-Theoretic Framework for Semantic Distance
  • 本地全文:下载
  • 作者:Vivian Tsang ; Suzanne Stevenson
  • 期刊名称:Computational Linguistics
  • 印刷版ISSN:0891-2017
  • 电子版ISSN:1530-9312
  • 出版年度:2010
  • 卷号:36
  • 期号:1
  • 页码:31-69
  • DOI:10.1162/coli.2010.36.1.36101
  • 语种:English
  • 出版社:MIT Press
  • 摘要:Many NLP applications entail that texts are classified based on their semantic distance (how similar or different the texts are). For example, comparing the text of a new document to that of documents of known topics can help identify the topic of the new text. Typically, a distributional distance is used to capture the implicit semantic distance between two pieces of text. However, such approaches do not take into account the semantic relations between words. In this article, we introduce an alternative method of measuring the semantic distance between texts that integrates distributional information and ontological knowledge within a network flow formalism. We first represent each text as a collection of frequency-weighted concepts within an ontology. We then make use of a network flow method which provides an efficient way of explicitly measuring the frequency-weighted ontological distance between the concepts across two texts. We evaluate our method in a variety of NLP tasks, and find that it performs well on two of three tasks. We develop a new measure of semantic coherence that enables us to account for the performance difference across the three data sets, shedding light on the properties of a data set that lends itself well to our method.
国家哲学社会科学文献中心版权所有