首页    期刊浏览 2024年12月04日 星期三
登录注册

文章基本信息

  • 标题:Web Search Engine Based Semantic Similarity Measure Between Words Using Pattern Retrieval Algorithm
  • 本地全文:下载
  • 作者:Pushpa C N ; Thriveni J ; Venugopal K R
  • 期刊名称:Computer Science & Information Technology
  • 电子版ISSN:2231-5403
  • 出版年度:2013
  • 卷号:3
  • 期号:1
  • 页码:01-11
  • DOI:10.5121/csit.2013.3101
  • 出版社:Academy & Industry Research Collaboration Center (AIRCC)
  • 摘要:Semantic Similarity measures plays an important role in information retrieval, natural language processing and various tasks on web such as relation extraction, community mining, document clustering, and automatic meta-data extraction. In this paper, we have proposed a Pattern Retrieval Algorithm [PRA] to compute the semantic similarity measure between the words by combining both page count method and web snippets method. Four association measures are used to find semantic similarity between words in page count method using web search engines. We use a Sequential Minimal Optimization (SMO) support vector machines (SVM) to find the optimal combination of page counts-based similarity scores and top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous word-pairs and non- synonymous word-pairs. The proposed approach aims to improve the Correlation values, Precision, Recall, and F-measures, compared to the existing methods. The proposed algorithm outperforms by 89.8 % of correlation value.
  • 关键词:Information Retrieval; Semantic Similarity; Support Vector Machine; Web Mining; Web Search ;Engine; Web Snippets
国家哲学社会科学文献中心版权所有