首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:M-SANIT: A FRAMEWORK FOR EFFECTIVE BIG DATA SANITIZATION USING MAP REDUCE PROGRAMMING IN HADOOP
  • 本地全文:下载
  • 作者:Y.SOWMYA ; Dr. M NAGARATNA ; Dr.C SHOBA BINDU
  • 期刊名称:Journal of Theoretical and Applied Information Technology
  • 印刷版ISSN:1992-8645
  • 电子版ISSN:1817-3195
  • 出版年度:2018
  • 卷号:96
  • 期号:6
  • 出版社:Journal of Theoretical and Applied
  • 摘要:Sanitization of big data before it is subjected to mining or publishing is very important for privacy reasons. Though sanitization is not new, sanitization of big data based on a measurability score is a novel idea. We proposed a framework known as M-Sanit to realize this idea. The framework is meant for big data sanitization prior to processing it. We proposed an extended misusablity score function that can return misuse probability of given dataset. This score plays an important role in determining the level of sanitization needed. This kind of sanitization provides expected level of anonymity and protects data from privacy attacks. The rationale behind this is that outsourced data may be misused by insiders. To get rid of this problem, the data is subjected to sanitization after finding measurability score. Our contributions in this paper are two-fold. First we provided mathematical model for extended measurability score. Second we proposed an algorithm to utilize the measurability score to determine the level of sanitization. We built a prototype application using locally configured Hadoop in clustered environment to demonstrate proof of the concept. Our results revealed the utility of M-Sanit for protecting big data from privacy problems.
  • 关键词:Big data; Hadoop; Map Reduce; Misusability measure; sanitization
国家哲学社会科学文献中心版权所有