首页    期刊浏览 2024年12月12日 星期四
登录注册

文章基本信息

  • 标题:PREDICTIVE DATA MINING TECHNIQUES FOR MANAGEMENT OF HIGH DIMENSIONAL BIG-DATA
  • 本地全文:下载
  • 作者:SONI LANKA ; RADHA MADHAVI M ; BASHIR SULEMAN ABUSAHMIN
  • 期刊名称:Journal of Industrial Pollution Control
  • 印刷版ISSN:0970-2083
  • 出版年度:2017
  • 卷号:33
  • 期号:1
  • 页码:1430-1436
  • 语种:English
  • 出版社:Research and Reviews
  • 摘要:Data mining is a technique, wherein the historical data is explored in search of a systematic relationship between variables and/or have a consistent pattern. This relationship is utilized to validate the outcomes by applying the identified patterns onto new data subsets. This paper compares three predictive data-mining techniques, namelymultiple linear regression, principal component regression and the partial least squares ona unique dataset. This data is unique, having a characteristics combination of presence of outliers, highly collinear variables,very redundant variables and predictor variables. In the initial step after pre-preparing information, negligible number of factors are chosen that can totally anticipate the reaction variable. These diverse information mining strategies, which has distinctive use techniques were actualized on the total informational index and the best strategy in every procedure was distinguished and this is utilized for worldwide examination with different systems for similar information.
  • 关键词:Multiple linear regressions; Principal component regression; Partial least squares
国家哲学社会科学文献中心版权所有