期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
印刷版ISSN:2277-6451
电子版ISSN:2277-128X
出版年度:2013
卷号:3
期号:5
出版社:S.S. Mishra
摘要:Web log mining can be described as the discovery and analysis of access patterns of users through mining of log files. For analyzing the customer's behavior, the data generated by the users visiting the website must be analyzed. The users' accesses to Web sites are stored in server log files. But the data stored in these log files do not present an accurate picture of the users' accesses to the Web site. So the preprocessing of web log data is a pre-requisite phase before it can be used for mining tasks. The preprocessed web data then is suitable for web mining. This paper presents various steps involved in preprocessing of web log files.
关键词:Web Server Logs; Data Preprocessing; Data cleaning; User Identification; Session Identification; Path ;Completion