期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
印刷版ISSN:2277-6451
电子版ISSN:2277-128X
出版年度:2013
卷号:3
期号:8
出版社:S.S. Mishra
摘要:A massive volume of both structured and unstructured data that is so large are becoming difficult to process and manage with traditional databases and software techniques. These data sources may include prepared data such as databases, device capable, click streams and location information, as well as unstructured data like email, HTML, social data and images. The social network database (Eg: Facebook, Twitter, Google+, YouTube, Flicker) represents millions PB of data and the databases are doubled during every three months. The analysing of such data is very challenges issues. The overall goal of Big Data is to provide a scalable solution for vast quantities of data (Terabyte/ Petabytes / Exabyte's) while maintaining reasonable processing times. Big Data forms a way through which it becomes easy to scale, diversify, and interactively analyse huge amount of data that has hundreds of billions of rows within the tables. To accomplish efficient processing of huge amounts of data, companies will need to intelligently incorporate big data into their existing information management systems and take advantage of the developing ecosystem of integration and analysis tools. This study gives an overview of Big Data and their importance
关键词:Big Data Analytic Tools; Data Mining; Hadoop and MapReduce; HBase and Hive tools; User-Friendly