期刊名称:International Journal of Advances in Soft Computing and Its Applications
印刷版ISSN:2074-8523
出版年度:2014
卷号:6
期号:2
出版社:International Center for Scientific Research and Studies
摘要:Generally, big data computing deals with massive and high dimensional data such as DNA microrray data, financial data, medical imagery, satellite imagery and hyperspectral imagery. Therefore, big data computing needs advanced technologies or methods to solve the issues of computational time to extract valuable information without information loss. In this context, generally, Machine Learning (ML) algorithms have been considered to learn and find useful and valuable information from large value of data. However, ML algorithms such as Neural Networks are computationally expensive, and typically the central processing unit (CPU) is unable to cope with these requirements. Thus, we need high performance computer to execute faster solutions such Graphical Processing Unit (GPU). GPUs provide remarkable performance gains compared to CPUs. The GPU is relatively inexpensive with affordable price, availability and scalability. Since 2006, NVIDIA provides simplification of the GPU programming model with the Compute Unified Device Architecture (CUDA), which supports for accessible programming interfaces and industry-standard languages, such as C and C++. Since then, General Purpose Graphical Processing U nit (GPGPU) using ML algorithms are applied on various applications; including signal and image pattern classification in biomedical area. The importance of fast analysis of detecting cancer or non-cancer becomes the motivation of this study. Accordingly , we proposed machine learning framework and analytics of Self Organizing Map (SOM) and Multiple Back Propagation (MBP) for big biomedical data classification problems. Big data such as gene expression datasets are executed on high performance computer and Fermi architecture graphical hardware. Based on the experiment, MBP and SOM with GPU - Tesla generates faster computing times than high performance computer with feasible results in terms of speed performance
关键词:GPGPU; big data framework; machine learning; soft computing; ; SOM; MBP; biomedical classification problems