期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
印刷版ISSN:2277-6451
电子版ISSN:2277-128X
出版年度:2013
卷号:3
期号:5
出版社:S.S. Mishra
摘要:The presence of missing data in a datasets can affect the performance of classifier which leads to difficulty of extracting useful information from datasets .Dataset taken for this study is student records of university system that contains some missing values. To compute these missing values three technique are used named as Litwise deletion, Mean/mode imputation and KNN imputation, which result in imputed datasets. On these resulting datasets C4.5 classification algorithm is applied individually. This work analyzes the performance of imputation methods using C4.5 classifier on the basis of accuracy for handling missing data. Weka data mining tool is used for this experimental