期刊名称:Journal of Theoretical and Applied Information Technology
印刷版ISSN:1992-8645
电子版ISSN:1817-3195
出版年度:2014
卷号:62
期号:2
出版社:Journal of Theoretical and Applied
摘要:Web Usage Mining (WUM) is a term related to the extraction of knowledge from web log data. Web log data has a lot of irrelevant data to proceed WUM. Therefore, it requires several steps to get a good quality of data, because the final result of WUM depends on the quality of the input data. Therefore, in this paper we propose a new approach to overcome these problems, hence, it is called the two level clustering approach. The first level clustering is performed on the data in the form of access frequently and use non-hierarchical cluster method, followed by a second level clustering on the web log data in the form of user access. At the second level clustering, it combines cluster hierarchical and non-hierarchical methods. From the experiments, 90.78% on web log data quality is reached
关键词:Data Quality Improvement; Two Level Clustering; Web Log Data; Web Mining; Web Usage Mining