首页    期刊浏览 2024年11月29日 星期五
登录注册

文章基本信息

  • 标题:Imbalanced Data SVM Classification Method Based on Cluster Boundary Sampling and DT-KNN Pruning
  • 本地全文:下载
  • 作者:Li Peng ; Yu Xiao-yang ; Bi Ting-ting
  • 期刊名称:International Journal of Signal Processing, Image Processing and Pattern Recognition
  • 印刷版ISSN:2005-4254
  • 出版年度:2014
  • 卷号:7
  • 期号:2
  • 页码:61-68
  • DOI:10.14257/ijsip.2014.7.2.06
  • 出版社:SERSC
  • 摘要:This paper presents a SVM classification method based on cluster boundary sampling and sample pruning. We actively explore an effective solution to solve the difficult problem of imbalanced data set classification from data re-sampling and algorithm improving. Firstly, we creatively propose the method of cluster boundary sampling, using the clustering density threshold and the boundary density threshold to determine the cluster boundaries, in order to guide the process of re-sampling more scientifically and accurately. Secondly, we put forward a new sample pruning algorithm based on dynamic threshold KNN to deal with the complexity and overlapping problem of imbalanced data set. The phenomenon of data complexity and overlapping will reduce the classification performance and generalization ability of SVM classifier. Experiments show that our method acquires obviously promotion effect in various different imbalanced data sets and it can prove the validity and stability.
  • 关键词:Imbalanced Data Sets; Support Vector Machine; Cluster Sampling; Sample ; pruning; Classification
国家哲学社会科学文献中心版权所有