期刊名称:Journal of Intelligent Learning Systems and Applications
印刷版ISSN:2150-8402
电子版ISSN:2150-8410
出版年度:2016
卷号:08
期号:04
页码:63-76
DOI:10.4236/jilsa.2016.84006
语种:English
出版社:Scientific Research Publishing
摘要:Automatic web page classification has become inevitable for web directories due to the multitude of web pages in the World Wide Web. In this paper an improved Term Weighting technique is proposed for automatic and effective classification of web pages. The web documents are represented as set of features. The proposed method selects and extracts the most prominent features reducing the high dimensionality problem of classifier. The proper selection of features among the large set improves the performance of the classifier. The proposed algorithm is implemented and tested on a benchmarked dataset. The results show the better performance than most of the existing term weighting techniques.