首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:Improving Performance of the k-Nearest Neighbor Classifier by Combining Feature Selection with Feature Weighting
  • 本地全文:下载
  • 作者:Yongguang Bao ; Xiaoyong Du ; Naohiro Ishii
  • 期刊名称:人工知能学会論文誌
  • 印刷版ISSN:1346-0714
  • 电子版ISSN:1346-8030
  • 出版年度:2002
  • 卷号:17
  • 期号:3
  • 页码:209-216
  • DOI:10.1527/tjsai.17.209
  • 出版社:The Japanese Society for Artificial Intelligence
  • 摘要:The k-nearest neighbor (k-NN) classification is a simple and effective classification approach. However, it suffers from over-sensitivity problem due to irrelevant and noisy features. There are two ways to relax such sensitivity. One is to assign each feature a weight, and the other way is to select a subset of relevant features. Existing researches showed that both approaches can improve generalization accuracy, but it is impossible to predict which one is better for a specific dataset. In this paper, we propose an algorithm to improve the effectiveness of k-NN by combining these two approaches. Specifically, we select all relevant features firstly, and then assign a weight to each relevant feature. Experiments have been conducted on 14 datasets from the UCI Machine Learning Repository, and the results show that our algorithm achieves the highest accuracy or near to the highest accuracy on all test datasets. It increases generalization accuracy 8.68% on the average. It also achieves higher generalization accuracy compared with well-known machine learning algorithm IB1-4 and C4.5.
  • 关键词:machine learning ; classification ; feature selection ; feature weighting ; rough sets ; nformation theory
国家哲学社会科学文献中心版权所有