首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Feature Weight Optimization Mechanism for Email Spam Detection based on Two-step Clustering Algorithm and Logistic Regression Method
  • 本地全文:下载
  • 作者:Ahmed Hamza Osman ; Hani Moetque Aljahdali
  • 期刊名称:International Journal of Advanced Computer Science and Applications(IJACSA)
  • 印刷版ISSN:2158-107X
  • 电子版ISSN:2156-5570
  • 出版年度:2017
  • 卷号:8
  • 期号:10
  • DOI:10.14569/IJACSA.2017.081054
  • 出版社:Science and Information Society (SAI)
  • 摘要:This research proposed an improved filtering spam technique for suspected emails, messages based on feature weight and the combination of two-step clustering and logistic regression algorithm. Unique, important features are used as the optimum input for a hybrid proposed approach. This study adopted a spam detector model based on distance measure and threshold value. The aim of this model was to study and select distinct features for email filtering using feature weight method as dimension reduction. Two-step clustering algorithm was used to generate a new feature called “Label” to cluster and differentiate the diversity emails and group them based on the inter samples similarity. Thereby the spam filtering process was simplified using the Logistic regression classifier in order to distinguish the hidden patterns of spam and non-spam emails. Experimental design was conducted based on the UCI spam dataset. The outcome of the findings shows that the results of the email filtering are promising compared to other modern spam filtering methods.
  • 关键词:Two-step clustering; spam filtering; classification; detection; feature weight; logistic regression
国家哲学社会科学文献中心版权所有