期刊名称:International Journal of Computer Science and Network Security
印刷版ISSN:1738-7906
出版年度:2013
卷号:13
期号:3
页码:112-119
出版社:International Journal of Computer Science and Network Security
摘要:Many abnormal topics or remarks on the world wide web may like crime, violence etc may disturb the public morality and cause social unrest. Most traditional methods filter a page as long as it contains a keyword in a predefined blacklist. Such methods cannot provide a quantitative measure of how sensitive the content is. In this paper, we propose a utility-based Web content sensitivity mining approach. Utility is viewed as the measure of how sensitive a page is. It allows the Internet regulators to take different operations according to different sensitivity values. We apply our approach on a real-world Web dataset. By varying the sensitive values of the keywords, different sets of high sensitivity keywords were discovered.