首页    期刊浏览 2025年03月01日 星期六
登录注册

文章基本信息

  • 标题:A Reformed K-Nearest Neighbors Algorithm for Big Data Sets
  • 作者:Phu, Vo Ngoc ; Ngoc Tran, Vo Thi
  • 期刊名称:Journal of Computer Science
  • 印刷版ISSN:1549-3636
  • 出版年度:2018
  • 卷号:14
  • 期号:9
  • 页码:1213-1225
  • DOI:10.3844/jcssp.2018.1213.1225
  • 出版社:Science Publications
  • 摘要:A Data Mining Has Already Had Many Algorithms Which A K-Nearest Neighbors Algorithm, K-NN, Is A Famous Algorithm For Researchers. K-NN Is Very Effective On Small Data Sets, However It Takes A Lot Of Time To Run On Big Datasets. Today, Data Sets Often Have Millions Of Data Records, Hence, It Is Difficult To Implement K-NN On Big Data. In This Research, We Propose An Improvement To K-NN To Process Big Datasets In A Shortened Execution Time. The Reformed K-Nearest Neighbors Algorithm (R-K-NN) Can Be Implemented On Large Datasets With Millions Or Even Billions Of Data Records. R-K-NN Is Tested On A Data Set With 500,000 Records. The Execution Time Of R-K-NN Is Much Shorter Than That Of K-NN. In Addition, R-K-NN Is Implemented In A Parallel Network System With Hadoop Map (M) And Hadoop Reduce (R).
  • 关键词:K-Nearest Neighbors Algorithm; K-NN; Parallel Network Environment; Distributed System; Data Mining; Association Rules; Cloudera; Hadoop Map; Hadoop Reduce
Loading...
联系我们|关于我们|网站声明
国家哲学社会科学文献中心版权所有