首页    期刊浏览 2024年12月05日 星期四
登录注册

文章基本信息

  • 标题:DTO-SMOTE: Delaunay Tessellation Oversampling for Imbalanced Data Sets
  • 本地全文:下载
  • 作者:Alexandre M. de Carvalho ; Ronaldo C. Prati
  • 期刊名称:Information
  • 电子版ISSN:2078-2489
  • 出版年度:2020
  • 卷号:11
  • 期号:12
  • 页码:557-578
  • DOI:10.3390/info11120557
  • 出版社:MDPI Publishing
  • 摘要:One of the significant challenges in machine learning is the classification of imbalanced data. In many situations, standard classifiers cannot learn how to distinguish minority class examples from the others. Since many real problems are unbalanced, this problem has become very relevant and deeply studied today. This paper presents a new preprocessing method based on Delaunay tessellation and the preprocessing algorithm SMOTE (Synthetic Minority Over-sampling Technique), which we call DTO-SMOTE (Delaunay Tessellation Oversampling SMOTE). DTO-SMOTE constructs a mesh of simplices (in this paper, we use tetrahedrons) for creating synthetic examples. We compare results with five preprocessing algorithms (GEOMETRIC-SMOTE, SVM-SMOTE, SMOTE-BORDERLINE-1, SMOTE-BORDERLINE-2, and SMOTE), eight classification algorithms, and 61 binary-class data sets. For some classifiers, DTO-SMOTE has higher performance than others in terms of Area Under the ROC curve (AUC), Geometric Mean (GEO), and Generalized Index of Balanced Accuracy (IBA).
  • 关键词:machine learning; SMOTE; oversampling; DTO-SMOTE; imbalanced data machine learning ; SMOTE ; oversampling ; DTO-SMOTE ; imbalanced data
国家哲学社会科学文献中心版权所有