首页    期刊浏览 2024年12月02日 星期一
登录注册

文章基本信息

  • 标题:Machine Learning Big Data Framework and Analytics for Big Data Problems
  • 本地全文:下载
  • 作者:Shafaatunnur Hasan ; Siti Mariyam Shamsuddin ; Noel Lopes
  • 期刊名称:International Journal of Advances in Soft Computing and Its Applications
  • 印刷版ISSN:2074-8523
  • 出版年度:2014
  • 卷号:6
  • 期号:2
  • 出版社:International Center for Scientific Research and Studies
  • 摘要:Generally, big data computing deals with massive and high dimensional data such as DNA microrray data, financial data, medical imagery, satellite imagery and hyperspectral imagery. Therefore, big data computing needs advanced technologies or methods to solve the issues of computational time to extract valuable information without information loss. In this context, generally, Machine Learning (ML) algorithms have been considered to learn and find useful and valuable information from large value of data. However, ML algorithms such as Neural Networks are computationally expensive, and typically the central processing unit (CPU) is unable to cope with these requirements. Thus, we need high performance computer to execute faster solutions such Graphical Processing Unit (GPU). GPUs provide remarkable performance gains compared to CPUs. The GPU is relatively inexpensive with affordable price, availability and scalability. Since 2006, NVIDIA provides simplification of the GPU programming model with the Compute Unified Device Architecture (CUDA), which supports for accessible programming interfaces and industry-standard languages, such as C and C++. Since then, General Purpose Graphical Processing U nit (GPGPU) using ML algorithms are applied on various applications; including signal and image pattern classification in biomedical area. The importance of fast analysis of detecting cancer or non-cancer becomes the motivation of this study. Accordingly , we proposed machine learning framework and analytics of Self Organizing Map (SOM) and Multiple Back Propagation (MBP) for big biomedical data classification problems. Big data such as gene expression datasets are executed on high performance computer and Fermi architecture graphical hardware. Based on the experiment, MBP and SOM with GPU - Tesla generates faster computing times than high performance computer with feasible results in terms of speed performance
  • 关键词:GPGPU; big data framework; machine learning; soft computing; ; SOM; MBP; biomedical classification problems
国家哲学社会科学文献中心版权所有