Association Rule Mining through Matrix Manipulation using Transaction Patternbase
PDF

Keywords

Association Rules, Frequent Patterns, Patternbase, Transaction Base, Matrix, Algorithm.

How to Cite

Shahid Kamal, Roliana Ibrahim, & Zia-ud-Din. (2012). Association Rule Mining through Matrix Manipulation using Transaction Patternbase. Journal of Basic & Applied Sciences, 8(1), 187–195. https://doi.org/10.6000/1927-5129.2012.08.01.30

Abstract

In data mining studies, mining of frequent patterns in transaction databases has been a popular area of research. Many approaches are being used to solve the problem of discovering association rules among items in large databases. We also consider the same problem. We present a new approach for solving this problem that is fundamentally different from the known techniques. In this study, we propose a transactional patternbase where transactions with same pattern are added as their frequency is increased. Thus subsequent scanning requires only scanning this compact dataset which increases efficiency of the respective methods. We have implemented this technique by using two-dimensional matrix instead of using FP-Growth method, as used by most of the algorithms. Empirical evaluation shows that this technique outperforms the database approach, implemented with FP-Growth, in many situations and performs exceptionally well when the repetition of transaction patterns is higher. We have implemented it using Visual Basic which has substantially reduced coding and computational cost. Success of this method will open new directions.

https://doi.org/10.6000/1927-5129.2012.08.01.30
PDF

References

Agrawal R. Imielinski T, Swami AN, Mining association rules between sets of items in large databases. Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, of SIGMOD Record, ACM Press 1993 volume 22(2): pp. 207-216. http://dx.doi.org/10.1145/170035.170072

Zaki MJ, Mining Non-redundant Association Rules, Data Mining and Knowledge Discovery, 2004; 9: pp. 223-248. http://dx.doi.org/10.1023/B:DAMI.0000040429.96086.c7

Agrawal R, Srikant R. Fast algorithms for mining association rules. Proceedings 20th International Conference on Very Large Data Bases, Morgan Kaufmann 1994; pp. 487-499.

Brin S, Motwani R, Ullman JD, Tsur S, Dynamic itemset counting and implication rules for market basket data. Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data, of SIGMOD Record, ACM Press, 1997; volume 26(2): pp. 255-264. http://dx.doi.org/10.1145/253260.253325

Pei J, Han J, Mao R, CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery 2000, Dallas, TX, 2000; pp. 21-30.

Han J, Pei J, Yin Y, Mining frequent patterns without candidate generation. Proceeding of 2000 ACM SIGMOD Int. Conf. Management of Data (SIGMOD’00), Dallas, TX, May 2000; pp. 1-12.

Han J, Pei J, Yin Y, Mao R, Mining frequent patterns without candidate generation: A frequent-pattern tree approach. Data Mining and Knowledge Discovery 2003.

Markus Hegland, Lecture Notes in Computer Science 2003; volume 2600: pp. 226-234. http://dx.doi.org/10.1007/3-540-36434-X_7

Zaki MJ, Hsiao C-J, CHARM: An efficient algorithm for closed itemset mining. In R. Grossman. Proceedings of the Second SIAM International Conference on Data Mining 2002.

THE LUCS-KDD SOFTWARE LIBRARY (LIVERPOOL UNIVERSITY COMPUTER SCIENCE KNOWLEDGE DISCOVERY IN DATAS) http://www.csc.liv.ac.uk/~frans/KDD/Software/FPgrowth/