Computer and Modernization ›› 2013, Vol. 218 ›› Issue (10): 1-5.doi: 10.3969/j.issn.1006-2475.2013.10.001

• 算法设计与分析 •     Next Articles

Apriori-based Web Traversal Pattern Mining Algorithm

LIU Mei-ling 1,2, SU Yi-juan 2,3   

  1. 1. College of Information Science and Engineering, Guangxi University for Nationalities, Nanning 530006, China;2. Key Laboratory of Science Computing and Intelligent Information Processing in Universities of Guangxi, Guangxi Teachers Education University, Nanning 530023, China;3. College of Computer and Information Engineering, Guangxi Teachers Education University, Nanning 530023, China
  • Received:2013-05-24 Revised:1900-01-01 Online:2013-10-26 Published:2013-10-26

Abstract: The Apriori algorithm and the directed graph representation method for Web traversal paths are briefly introduced, and an algorithm based on Apriori is proposed for generating frequent traversal patterns from Web log files. The proposed algorithm uses the orderliness of the traversal paths as pruning strategy of candidate set, thus it can decrease the scale of candidate sets and improve efficiency. Some experiments are conducted with real datasets and simulated datasets, and the experimental results show the effectiveness and good adaptability of the proposed algorithm.

Key words: WFTP algorithm, Web log file, data mining, frequently traversed path, sequential traversed path

CLC Number: