首页    期刊浏览 2024年11月30日 星期六
登录注册

文章基本信息

  • 标题:Study of Preprocessing Methods in Web Server Logs
  • 本地全文:下载
  • 作者:Dr. Sanjeev Dhawan ; Mamta Lathwal
  • 期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
  • 印刷版ISSN:2277-6451
  • 电子版ISSN:2277-128X
  • 出版年度:2013
  • 卷号:3
  • 期号:5
  • 出版社:S.S. Mishra
  • 摘要:Web log mining can be described as the discovery and analysis of access patterns of users through mining of log files. For analyzing the customer's behavior, the data generated by the users visiting the website must be analyzed. The users' accesses to Web sites are stored in server log files. But the data stored in these log files do not present an accurate picture of the users' accesses to the Web site. So the preprocessing of web log data is a pre-requisite phase before it can be used for mining tasks. The preprocessed web data then is suitable for web mining. This paper presents various steps involved in preprocessing of web log files.
  • 关键词:Web Server Logs; Data Preprocessing; Data cleaning; User Identification; Session Identification; Path ;Completion
国家哲学社会科学文献中心版权所有