期刊名称:International Journal on Computer Science and Engineering
印刷版ISSN:2229-5631
电子版ISSN:0975-3397
出版年度:2010
卷号:2
期号:7
页码:2447-2452
出版社:Engg Journals Publications
摘要:Today�s real world databases are highly susceptible to noisy, missing and inconsistent data due to their typically huge size data and their origin from multiple, heterogeneous sources. Hence, pre-processing of data is necessary to help improve the quality of data and consequently the mining results. There are number of data pre-processing techniques. In this paper, we would like to discuss two different approaches for data preprocessing one based on XML and other based on text file. But the basic algorithm and steps involved in pre-processing are considered same for both the approaches.
关键词:Pre-processing; Pattern Discovery; User Navigation Patter; weblogs