计算机与现代化
• 数据库与数据挖掘 • 下一篇
收稿日期:
出版日期:
发布日期:
作者简介:
基金资助:
Received:
Online:
Published:
摘要: Apriori算法在搜索频繁项集过程中,通常需要对数据库进行多次的重复扫描和产生大量无用的候选集,针对此问题提出一种基于矩阵约简的Apriori改进算法。该算法只需扫描一次数据库,将数据库信息转换成布尔矩阵,根据频繁k-项集的性质推出的结论来约简数据结构,有效地降低无效候选项集的生成规模。通过对已有算法的对比,验证该算法能有效地提高挖掘频繁项集的效
关键词:  , 数据挖掘, 关联规则, Apriori算法, 频繁项集, 矩阵约简
Abstract: During the search for frequent itemsets of the Apriori algorithm, the database is scanned repetitively and generates a large number of useless candidate sets. For this problem, a kind of improved Apriori algorithm based on the matrix reduction is put forward. The algorithm scans the database only once, converts the database information to Boolean matrix, and reduces the data structure according to the conclusion drawn from the nature of the frequent k-itemsets, which lowers the generation scale of the invalid candidate itemsets effectively. By comparing with the existing algorithms, it is validated that this algorithm can improve the efficiency of mining frequent itemsets effectively.
Key words: data mining, association rules, Apriori algorithm, frequent itemsets, matrix reduction
任伟建,于博文. 基于矩阵约简的Apriori算法改进[J]. 计算机与现代化.
REN Wei-jian, YU Bo-wen. Improved Apriori Algorithm Based on Matrix Reduction[J]. Computer and Modernization.
0 / / 推荐
导出引用管理器 EndNote|Ris|BibTeX
链接本文: http://www.c-a-m.org.cn/CN/
http://www.c-a-m.org.cn/CN/Y2015/V0/I9/1