首页    期刊浏览 2025年03月01日 星期六
登录注册

文章基本信息

  • 标题:Improved Term Weighting Technique for Automatic Web Page Classification
  • 本地全文:下载
  • 作者:Kathirvalavakumar Thangairulappan ; Aruna Devi Kanagavel
  • 期刊名称:Journal of Intelligent Learning Systems and Applications
  • 印刷版ISSN:2150-8402
  • 电子版ISSN:2150-8410
  • 出版年度:2016
  • 卷号:08
  • 期号:04
  • 页码:63-76
  • DOI:10.4236/jilsa.2016.84006
  • 语种:English
  • 出版社:Scientific Research Publishing
  • 摘要:Automatic web page classification has become inevitable for web directories due to the multitude of web pages in the World Wide Web. In this paper an improved Term Weighting technique is proposed for automatic and effective classification of web pages. The web documents are represented as set of features. The proposed method selects and extracts the most prominent features reducing the high dimensionality problem of classifier. The proper selection of features among the large set improves the performance of the classifier. The proposed algorithm is implemented and tested on a benchmarked dataset. The results show the better performance than most of the existing term weighting techniques.
  • 关键词:Web Page Classification;Term-Weighting Scheme;Feature Selection;Feature Extraction;Artificial Neural Network;Back Propagation
国家哲学社会科学文献中心版权所有