首页    期刊浏览 2024年12月11日 星期三
登录注册

文章基本信息

  • 标题:An Efficient Implementation of Re-Sampling Technique for High Performance Multiple Classifier Systems
  • 本地全文:下载
  • 作者:Sathiyabama, S. ; Thyagarajah, K. ; Ayyamuthukumar, D.
  • 期刊名称:Journal of Computer Science
  • 印刷版ISSN:1549-3636
  • 出版年度:2007
  • 卷号:3
  • 期号:4
  • 页码:195-198
  • DOI:10.3844/jcssp.2007.195.198
  • 出版社:Science Publications
  • 摘要:Due to the large size of the database, the entire training dataset could not be used to construct the classifiers. One popular solution is to separate stream data into chunks, learn a base classifier from each chunk and then integrate all base classifiers to form Multiple classifier system (MCS). Sometimes this data streams does not include all the classes in its equal proportion as in the entire training data set. So we have newly introduced a method of Re-Sampling based on the statistical value of the class attribute. In the Proposed Method, the probability of occurrences of every class for the entire training data set have been estimated. Based on the probability, thresholds have been fixed for all the classes. When the data set have been selected randomly, the probabilities of the classes have been checked against the thresholds. The sample, which satisfies all the thresholds, is allowed to construct the Model. Otherwise, Re-sampling is performed and the process is repeated until the sample satisfies all the thresholds for the classes. The proposed method yields more accuracy than the one which does not have threshold on classes in the random samples. We have also compared the accuracy of different classifiers. Experimental results and comparative studies demonstrate the efficiency and efficacy of our method.
  • 关键词:Accuracy; classifier; euclidean distance; sampling; threshold; normalization
国家哲学社会科学文献中心版权所有