计算机与现代化

• 数据库与数据挖掘 •    下一篇

 基于矩阵约简的Apriori算法改进

  

  1. 东北石油大学电气信息工程学院,黑龙江大庆163318
  • 收稿日期:2015-04-27 出版日期:2015-09-21 发布日期:2015-09-24
  • 作者简介:任伟建(1963-),女,黑龙江大庆人,东北石油大学电气信息工程学院教授,博士,研究方向:复杂系统的建模与控制; 于博文(1988-),男,黑龙江大庆人,硕士研究生,研究方向:智能控制,数据挖掘。
  • 基金资助:
     国家自然科学基金资助项目(61374127); 黑龙江省博士后科研启动资金(LBH-Q12143)

Improved  Apriori Algorithm Based on Matrix Reduction

  1. Electrical Information Engineering Institute, Northeast Petroleum University, Daqing 163318, China
  • Received:2015-04-27 Online:2015-09-21 Published:2015-09-24

摘要:  Apriori算法在搜索频繁项集过程中,通常需要对数据库进行多次的重复扫描和产生大量无用的候选集,针对此问题提出一种基于矩阵约简的Apriori改进算法。该算法只需扫描一次数据库,将数据库信息转换成布尔矩阵,根据频繁k-项集的性质推出的结论来约简数据结构,有效地降低无效候选项集的生成规模。通过对已有算法的对比,验证该算法能有效地提高挖掘频繁项集的效

关键词:  , 数据挖掘, 关联规则, Apriori算法, 频繁项集, 矩阵约简

Abstract: During the search for frequent itemsets of the Apriori algorithm, the database is scanned repetitively and generates a large number of useless candidate sets. For this problem, a kind of improved Apriori algorithm based on the matrix reduction is put forward. The algorithm scans the database only once, converts the database information to Boolean matrix, and reduces the data structure according to the conclusion drawn from the nature of the frequent k-itemsets, which lowers the generation scale of the invalid candidate itemsets effectively. By comparing with the existing algorithms, it is validated that this algorithm can improve the efficiency of mining frequent itemsets effectively.

Key words: data mining, association rules, Apriori algorithm, frequent itemsets, matrix reduction