期刊名称:International Journal of Advanced Research in Computer Engineering & Technology (IJARCET)
印刷版ISSN:2278-1323
出版年度:2015
卷号:4
期号:6
页码:2681-2685
出版社:Shri Pannalal Research Institute of Technolgy
摘要:Aim of this paper is immersed in effectiveclustering and mining approach with help of side information.Number of text mining applications, having side-informationwith them. This information may be of various forms, such asprovenance information of the documents, the links in thedocument, web logs which contains user-access behavior, orother text document which are embedded into the non-textualattributes. These attributes may contain a lot of information forclustering purposes. However, the concerned importance of thisside-information may be hard to count, especially when some ofthe information is noisy. In such cases, it can be hazardous tomerge side-information into the mining process, because it caneither enhance the quality of the representation or can addnoise in the system. Therefore, literature study suggests way todesign efficient algorithm which combines classical partitioningalgorithm with probabilistic model for effective clusteringapproach, so as to maximize the benefits from using sideinformation
关键词:Data mining; Data clustering; Meta information;Text mining.