首页    期刊浏览 2024年12月04日 星期三
登录注册

文章基本信息

  • 标题:Genetic Algorithm and Confusion Matrix for Document Clustering
  • 本地全文:下载
  • 作者:A. K. Santra ; C. Josephine Christy
  • 期刊名称:International Journal of Computer Science Issues
  • 印刷版ISSN:1694-0784
  • 电子版ISSN:1694-0814
  • 出版年度:2012
  • 卷号:9
  • 期号:1
  • 出版社:IJCSI Press
  • 摘要:Text mining is one of the most important tools in Information Retrieval. Text clustering is the process of classifying documents into predefined categories according to their content. Existing supervised learning algorithms to automatically classify text requires sufficient documentation to learn exactly. In this paper, Niching memetic algorithm and Genetic algorithm (GA) is presented in which feature selection an integral part of the global clustering search procedure that attempts to overcome the problem of finding optimal solutions at the local less promising in both clustering and feature selection. The concept of confusion matrix is then used for derivative works, and finally, hybrid GA is included for the final classification. Experimental results show benefits by using the proposed method which evaluates F-measure, purity and results better performance in terms of False positive, False negative, True positive and True negative.
  • 关键词:Text mining; GA; Confusion matrix; F;measure
国家哲学社会科学文献中心版权所有