首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:A Novel Kernel for Text Classification Based on Semantic and Statistical Information
  • 作者:Yao, Haipeng ; Zhang, Bo ; Zhang, Peiying
  • 期刊名称:COMPUTING AND INFORMATICS
  • 印刷版ISSN:1335-9150
  • 出版年度:2018
  • 卷号:37
  • 期号:4
  • 页码:992-1010
  • 语种:English
  • 出版社:COMPUTING AND INFORMATICS
  • 其他摘要:In text categorization, a document is usually represented by a vector space model which can accomplish the classification task, but the model cannot deal with Chinese synonyms and polysemy phenomenon. This paper presents a novel approach which takes into account both the semantic and statistical information to improve the accuracy of text classification. The proposed approach computes semantic information based on HowNet and statistical information based on a kernel function with class-based weighting. According to our experimental results, the proposed approach could achieve state-of-the-art or competitive results as compared with traditional approaches such as the k-Nearest Neighbor (KNN), the Naive Bayes and deep learning models like convolutional networks.
  • 关键词:Text categorization; semantic information; statistical information; support vector machine
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有