期刊名称:International Journal of Advanced Research In Computer Science and Software Engineering
印刷版ISSN:2277-6451
电子版ISSN:2277-128X
出版年度:2013
卷号:3
期号:1
出版社:S.S. Mishra
摘要:Association Rule Mining (ARM) finds the interesting relationship between presences of various items in a given database. Apriori is the traditional algorithm for learning association rules. However, it is aff ected by number of database scan and higher generation of candidate itemsets. Each level of candidate itemsets requires separate memory locations. Hash Based Frequent Itemsets - Quadratic Probing (HBFI - QP) algorithm, which is based on hashing technique for mining the frequent itemsets. In order to stay away from collisions and primary clustering in hashing process, Quadratic Probing (QP) technique is used. Though the primary clustering and collisions are eliminated, secondary clustering is formed in all cases and the hash table occupies more space than the total number of items in the database. To avoid those problems, the H-Bit Array Hashing (H-BAH) algorithm is presented in this paper. H-BAH algorithm reduces hash table size required for placing items an d it also avoids hash collisions and secondary clustering. The H-Bit array that is added to the first or header bucket of the table gives the information about which buckets are hashed initially. At the time of collisions in the hashing process, the H -BAH algorithm works by finding the neighbourhood of buckets near the original hashed bucket, in order to place the collided items quickly. The H-BAH algorithm provides frequent itemsets with less computational time and memory than the existing algorithm.
关键词:Frequent itemsets; Hashing; Collisions; Linked List