首页    期刊浏览 2025年03月03日 星期一
登录注册

文章基本信息

  • 标题:A multiple clustering combination approach based on iterative voting process
  • 本地全文:下载
  • 作者:Soufiane Khedairia ; Mohamed Tarek Khadir
  • 期刊名称:Journal of King Saud University @?C Computer and Information Sciences
  • 印刷版ISSN:1319-1578
  • 出版年度:2022
  • 卷号:34
  • 期号:1
  • 页码:1370-1380
  • 语种:English
  • 出版社:Elsevier
  • 摘要:This paper addresses the problem of clustering ensemble which aims to combine multiple clusterings into a probably better solution in terms of robustness, novelty and stability. The proposed Iterative Combining Clusterings Method (ICCM) processes iteratively the entire dataset, where each iteration is based on two steps framework. In the first step, different clustering algorithms process the common dataset individually and, in the next step, a set of sub-clusters is extracted through a voting process among the data objects. To overcome the ambiguity due to voting, only objects with majority voting are assigned to their correspondent sub-clusters. The remaining objects are then collected and re-clustered in the next iterations. At the end of the iterative process, a clustering algorithm is used to group the obtained sub-cluster centres and extract the final clusters of the dataset. Two gene expression datasets and three real-life datasets have been used to evaluate the proposed approach using external and internal criteria. The experimental results demonstrate the effectiveness and robustness of the proposed method, where an improvement up to 16.89% for iris dataset, and up to 14.98% for wine dataset in DB index has been achieved. The external validity metrics confirm the usefulness of the proposed approach by achieving the highest average NMI (%) score of 81.05%, across the datasets compared to different clustering ensemble methods.
国家哲学社会科学文献中心版权所有