首页    期刊浏览 2024年12月13日 星期五
登录注册

文章基本信息

  • 标题:Processing unstructured documents and social media using Big Data techniques
  • 本地全文:下载
  • 作者:Diaconita, Vlad
  • 期刊名称:Economic Research
  • 印刷版ISSN:1331-677X
  • 出版年度:2015
  • 卷号:28
  • 期号:1
  • 页码:981-993
  • DOI:10.1080/1331677X.2015.1095110
  • 语种:English
  • 出版社:Juraj Dobrila University of Pula, Department of Economics and Tourism 'Dr. Mijo Mirkovic'
  • 摘要:Big Data technologies can be very useful when it comes to storing and processing using sophisticated algorithms, terabytes or petabytes of data. With the latest advancements, such as Hadoop YARN, processing can be done not only in batch but also in real time. In this paper, we detail a methodology followed by a case study that investigates the power of machine learning algorithms used in a Hadoop environment in classifying unstructured data. We also investigate how to capture geolocated messages from social networks and how kriging can be used to see if there is a strong relationship between two or more such datasets.
  • 关键词:Hadoop; MapReduce; k-NN; social media; geolocated messages; large data sets
国家哲学社会科学文献中心版权所有