Synthesizing algorithm for mining composite-frequent item sets
-
摘要: 关联规则挖掘的关键在于频繁项目集的求解,为了能够在含有数值类型数据的交易数据库中快速求解含有多值的频繁项目集,拓展了含有多种数值的交易数据库定义.在此基础上,根据树的思想,建立含有交易项和交易数量的树,并结合Apriori算法和智能搜索,提出在各个较小的树枝路径中求解频繁项目集求解方法FABCTA(Fast Algorithm ByCandidate Transaction Tree and Apriori).通过采用真实数据实验对比,FABCTA效率明显优于Apriori算法.Abstract: It is very important to get the frequent item set in the associate rule mining. In order to fast obtain the frequent item set from a database that includes multiple values, the definition of transaction database was extended. And then by the tree concept, a special tree was built in which every node is formed by item and item’s count. At last, on the foundation of Apriori Algorithm and Artificial Intelligent Search, FABCTA(fast algorithm by candidate transaction tree and apriori) was presented to solve the frequent item set in small branches of tree. By the test on real data, FABCTA is more efficient than Apriori algorithm.
-
Key words:
- databases /
- trees /
- rules /
- search theory /
- Apriori algorithm
-
[1] Agrawal R, Srikant R. Fast algorithms for mining association rules. Proceedings of the 20th International Conference on Very Large Databases. Santiago, Chile, 1994. 487~499 [2] 李国和,吴卫江.数据库中关联规则的挖掘. 计算机科学. 2001,28(5,专刊):108~110 Li Guohe,Wu Weijiang. Associate rule mining in database. Computer Science, 2001,28(5, monograph):108~110 (in Chinese) [3] Han Jiawei, Micheline Kambr. Data Mining—Concepts and techniques[M].Beijing:Higher Education Press,2001 [4] 林尧瑞,马少平.人工智能导论[M].北京:清华大学出版社,1997.50~60 Lin Yaorui, Ma Shaoping. Introduction of artificial intelligence[M]. Beijing:Tsinghua University Press,1997.50~60(in Chinese) [5] 李国和,吴卫江,刘延伟,等.特征选取及其在测井系列选定中的应用. 见:王珏.计算机科学. 2002,29(9,专刊):337~340 Li Guohe,Wu Weijiang,Liu Yanwei, et al. Feature selection and it’s application in logging suit selection. In:Wang Jue.Computer Science. 2002,29(9, monograph):337~340(in Chinese)
点击查看大图
计量
- 文章访问数: 2508
- HTML全文浏览量: 56
- PDF下载量: 794
- 被引次数: 0