[1]WITTEN I H, FRANK E, HALL M A. 数据挖掘:实用机器学习工具与技术[M]. 北京:机械工业出版社, 2012.
[2]AGRAWAL R, IMIELINSKI T, SWAMI A. Mining association rules between sets of items in large database[C]// Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data. 1993:207-216.
[3]Apache Software Foundation. Apache Hadoop 2.7.2[DB/OL]. (2016-01-26)[2016-03-18]. http://hadoop.apache.org/docs/r2.7.2/.
[4]ZAHARIA M, CHOWDHURY M, FRANKLIN M J, et al. Spark: Cluster computing with working sets[C]// Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing. 2010: Article No. 10. DOI: 10.1007/s00256-009-0861-0.
[5]QIU H J, GU R, YUAN C F, et al. YAFIM: A parallel frequent itemset mining algorithm with Spark[C]// Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium Workshops. 2014:1664-1671.
[6]RATHEE S, KAUL M, KASHYAP A. R-Apriori: An efficient Apriori based algorithm on Spark[C]// Proceedings of the 8th Workshop on Ph.D. Workshop in Information and Knowledge Management. 2015:27-34.
[7]崔妍,包志强. 关联规则挖掘综述[J]. 计算机应用研究, 2016,33(2):330-334.
[8]BLOOM B H. Space/time trade-offs in hash coding with allowable errors[J]. Communications of the ACM, 1970,13(7):422-426.
[9]杨磊,黄建智. 多路平衡型矩阵Bloom Filter[J]. 湖南大学学报(自然科学版), 2018,45(2):133-140.
[10]肖明忠,代亚非,李晓明. 拆分型Bloom Filter[J]. 电子学报, 2004,32(2):241-245.
[11]DEAN J, GHEMAWAT S. MapReduce: Simplified data processing on large clusters[J]. Communications of the ACM, 2008,51(1):107-113.
[12]AKIL B, ZHOU Y, ROHM U. On the usability of Hadoop MapReduce, Apache Spark & Apache Flink for data science[C]// Proceedings of the 2017 IEEE International Conference on Big Data. 2017:303-310.
[13]吴信东,嵇圣硙. MapReduce与Spark用于大数据分析之比较[J]. 软件学报, 2018,29(6):1770-1791.
[14]SUMITHRA R, PAUL S, LATHA D P P. A hybrid algorithm combining weighted and hashT Apriori algorithms in Map Reduce model using Eucalyptus cloud platform[J]. WSEAS Transactions on Computers, 2015,14:382-388.
[15]DHANYA S, VYSAAKAN M, MAHESH A S. An enhancement of the MapReduce Apriori algorithm using vertical data layout and set theory concept of intersection[M]// Intelligent Systems Technologies and Applications. Springer, 2016,2:225-233.
[16]SINGH S, GARG R, MISHRA P K. Performance optimization of MapReduce-based Apriori algorithm on Hadoop cluster[J]. Computers & Electrical Engineering, 2018,67:348-364.
[17]谢志明,王鹏. 基于MapReduce架构的并行矩阵Apriori算法[J]. 计算机应用研究, 2017,34(2):401-404.
[18]程阳,章韵. 基于MapReduce-HBase的Apriori算法的改进与研究[J]. 南京邮电大学学报(自然科学版), 2018,38(5):91-99.
[19]刘莉萍,章新友,牛晓录,等. 基于Spark的并行关联规则挖掘算法研究综述[J/OL]. 计算机工程与应用, (2019-01-30)[2019-04-20]. http://kns.cnki.net/kcms/detail/11.2127.TP.20190128.1804.009.html.
[20]LUO Y H, YANG Z F, SHI H K, et al. A distributed frequent itemsets mining algorithm using sparse Boolean matrix on Spark[M]// Web Technologies and Applications. Springer, 2016:419-423.
[21]KARIM R, COCHEZ M, BEYAN O D, et al. Mining maximal frequent patterns in transactional databases and dynamic data streams: A Spark-based approach[J]. Information Sciences, 2018,432:278-300.
[22]FIMI Workshops. Frequent Itemset Mining Dataset Repository[EB/OL]. [2019-04-20]. http://fimi.uantwerpen.be/data/.
[23]SPMF. An Open-Source Data Mining Library[EB/OL]. [2019-04-20]. http://www.philippe-fournier-viger.com/spmf/index.php?link=datasets.php. |