A Comparative Analysis on Mining Frequent Itemsets

D.Kerana Hanirex

Abstract


Research on mining frequent itemsets is one of ` the emerging task in data mining.The purchasing of one product when another product is purchased represents an association rule. Association rules are useful for analyzing the  customer behavior. It takes an important part in shopping basket data analysis, clustering. The FP-Growth and Apriori  algorithm is the basic algorithm for mining association rules. This paper presents an efficient algorithm for mining frequent itemsets using  Two Dimensional Transactions Reduction(TDTR)  approach which  reduces the original  database(D) transactions to the reduced data base transactions D1 based on the min_sup count. Then for each item it finds the number of transactions that the item present  and hence find  the largest frequent itemset using the  two dimensional approach. Using the largest item set property ,it finds the subset of frequent item sets. Thus  TDTR approach  reduces the number of scans in the database and hence improve the efficiency & accuracy by finding the number of association rules and reduces time to find the rules.This proposed approach compares the efficiency with traditional Apriori and FP-Growth algorithm.


Keywords


Data mining,Association rule, Apriori Algorithm,FP-Growth algorithm, frequent Itemset, transaction reduction

References


Lee-Wen Huang ,Ye-In Chang,“A Graph-Based Approach for Mining Closed Large Itemsets” National Sun Yat-Sen University.

Sheng Chai, Jia Yang and Yang Cheng, “The Research of Improved Apriori Algorithm for Mining Association Rules” In Proceedings of the Service Systems and Service Management, 2007 International Conference, 9-11 June 2007

pages : 1 – 4.

Ja-HwungSu,Wen-Yang Lin “CBW: An Efficient Algorithm for Frequent Itemset Mining” In Proceedings of the 37th Hawaii International Conference on System Sciences – 2004.

R. Agrawal and R. Srikant, "Fast Algorithms for Mining Association Rules," in Proceedings of the 20th VLDB Conference, 1994, pp. 487−499.

Arun K Pujari. Data Mining Techniques (Edition 5):Hyderabad, India: Universities Press (India) Private Limited, 2003.

Margatet H. Dunham. Data Mining, Introductory and Advanced Topics: Upper Saddle River, New Jersey: Pearson Education Inc., 2003.

Jiawei Han. Data Mining, concepts and Techniques: San Francisco, CA: Morgan Kaufmann Publishers.,2004.

Akhilesh Tiwari, Rajendra K. Gupta, and Dev Prakash Agrawal “Cluster Based Partition Approach for Mining Frequent Itemsets” In Proceedings of the IJCSNS International Journal of computer Science and Network Security, VOL.9 No.6, June 2009

R.K. Gupta. Development of Algorithms for New Association Rule Mining System, Ph.D. Thesis, Submitted to ABV-Indian Institute of information Technology & Management, Gwalior, India, 2004.

M. Houtsma and A. Swami. Set Oriented Mining for Association Rules in Relational Databases. In Proceedings of 11th International conference on Data Engineering, 1995, pp 25-33,.

Agarwal R., Imielinski T., and Swami A. Mining associations between sets of items in massive databases. In Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington D.C. , May 1993, pp. 207-216.

M. Houtsma and A. Swami, Set Oriented Mining for Association Rules in RelationalDatabases. In Proceedings of 11th IEEE International Conference on Data Engineering, 1995, pp : 25-33.

Rakesh Agrawal and R . Srikant . Fast Algorithm for Mining Association Rules in Large Databases, Proceedings of the 20th International Conference on Very Large Databases, Santigo, Chile, 1994, pp 487-499.

Jiawei Han,Hong Cheng,Dong Xin and Xifeng Yan, “ Frequent pattern mining : current status and future directions “ ,Data Mining Knowledge Discovery(2007) 15:55-86.

Kerana Hanirex.D and M.A.Dorai Rengaswamy,”Efficient Algorithm for Mining Frequent Itemsets using Clustering techniques”,In Proceedings of the IJCSE Vol. 3 No. 3 Mar 2011,pp 1028-1032.


Full Text: PDF

Refbacks

  • There are currently no refbacks.




 


All Rights Reserved © 2012 IJARCSEE


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 Unported License.