首页    期刊浏览 2024年12月03日 星期二
登录注册

文章基本信息

  • 标题:Performance Analysis of Machine Learning Algorithms for Missing Value Imputation
  • 作者:Nadzurah Zainal Abidin ; Amelia Ritahani Ismail ; Nurul A. Emran
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2018
  • 卷号:9
  • 期号:6
  • DOI:10.14569/IJACSA.2018.090660
  • 出版社:Science and Information Society (SAI)
  • 摘要:Data mining requires a pre-processing task in which the data are prepared, cleaned, integrated, transformed, reduced and discretized for ensuring the quality. Missing values is a universal problem in many research domains that is commonly encountered in the data cleaning process. Missing values usually occur when a value of stored data absent for a variable of an observation. Missing values problem imposes undesirable effect on analysis results, especially when it leads to biased parameter estimates. Data imputation is a common way to deal with missing values where the missing value’s substitutes are discovered through statistical or machine learning techniques. Nevertheless, examining the strengths (and limitations) of these techniques is important to aid understanding its characteristics. In this paper, the performance of three machine learning classifiers (K-Nearest Neighbors (KNN), Decision Tree, and Bayesian Networks) are compared in terms of data imputation accuracy. The results shows that among the three classifiers, Bayesian has the most promising performance.
  • 关键词:Data Mining; Imputation; Machine Learning; KNearest Neighbors; Decision Tree; Bayesian Networks
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有