文章基本信息

标题：Improving Performance of the k-Nearest Neighbor Classifier by Combining Feature Selection with Feature Weighting
本地全文：下载
作者：Yongguang Bao ; Xiaoyong Du ; Naohiro Ishii 等
期刊名称：人工知能学会論文誌
印刷版ISSN：1346-0714
电子版ISSN：1346-8030
出版年度：2002
卷号：17
期号：3
页码：209-216
DOI：10.1527/tjsai.17.209
出版社：The Japanese Society for Artificial Intelligence
摘要：The k-nearest neighbor (k-NN) classification is a simple and effective classification approach. However, it suffers from over-sensitivity problem due to irrelevant and noisy features. There are two ways to relax such sensitivity. One is to assign each feature a weight, and the other way is to select a subset of relevant features. Existing researches showed that both approaches can improve generalization accuracy, but it is impossible to predict which one is better for a specific dataset. In this paper, we propose an algorithm to improve the effectiveness of k-NN by combining these two approaches. Specifically, we select all relevant features firstly, and then assign a weight to each relevant feature. Experiments have been conducted on 14 datasets from the UCI Machine Learning Repository, and the results show that our algorithm achieves the highest accuracy or near to the highest accuracy on all test datasets. It increases generalization accuracy 8.68% on the average. It also achieves higher generalization accuracy compared with well-known machine learning algorithm IB1-4 and C4.5.
关键词：machine learning ; classification ; feature selection ; feature weighting ; rough sets ; nformation theory