计算机与现代化 ›› 2009, Vol. 1 ›› Issue (12): 21-24.doi: 10.3969/j.issn.1006-2475.2009.12.006

• 人工智能 • 上一篇    下一篇

基于Apriori算法的频繁访问页面挖掘分析

王靓明1,杨文琳2,朱 敏1
  

  1. 1.南昌大学信息工程学院,江西 南昌 330031; 2.江西理工大学南昌校区,江西 南昌 330013
  • 收稿日期:2008-12-15 修回日期:1900-01-01 出版日期:2009-11-27 发布日期:2009-11-27

Analysis of Frequent Access Pages Mining Based on Apriori Algorithm

WANG Liang-ming1, YANG Wen-lin2, ZHU Min1
  

  1. 1. School of Information Engineering, Nanchang University, Nanchang 330031, China;2. Nanchang Campus, Jiangxi University of Science and Technology, Nanchang 330013, China
  • Received:2008-12-15 Revised:1900-01-01 Online:2009-11-27 Published:2009-11-27

摘要: 挖掘最大频繁项目集是数据挖掘应用中关联规则的关健问题。本文结合SQL语言的特点,对Apriori算法进行改进,由于改进后的Apriori算法不需要多次扫描数据库,实现过程不断采用临时表,不断缩小搜索空间,从而提高Apriori算法运行效率。将其应用至频繁访问页面挖掘中,从而找出Web访问信息的关联规则,更好地组织站点的Web空间,为用户带来方便。

关键词: SQL, 频繁访问页面, 关联规则, Apriori算法

Abstract: Mining maximum frequent item sets is the key issues in the application of data mining association rules. Combined with features of SQL language, improvement on Apriori algorithm can make efficiency improvements to a certain extent, because improved Apriori algorithm does not need to scan the database many times and narrows the search space constantly. The paper puts the application into frequently visiting in the page mining to find out the association rules of Web visited information, to better organize the Web site space for the convenience of users.

Key words: SQL, frequent access pages, association rule, Apriori algorithm

中图分类号: