计算机与现代化

• 数据库与数据挖掘 • 上一篇    下一篇

改进FP-Growth算法在旅游线路规划中的应用研究

  

  1. (北京印刷学院信息工程学院,北京102600)
  • 收稿日期:2017-07-04 出版日期:2018-03-08 发布日期:2018-03-09
  • 作者简介:字云飞(1991-),男,云南大理人,北京印刷学院信息工程学院硕士研究生,研究方向:数据挖掘,可视化分析; 李业丽(1962-),女,北京人,教授,研究方向:大数据,数据挖掘; 孙华艳(1990-),女,硕士研究生,研究方向:数据挖掘,可视化分析; 张莉婧(1992-),女,硕士研究生,研究方向:数据挖掘,可视化分析。
  • 基金资助:
    北京市科技创新服务能力协调创新项目(PXM2016_014223_000025)

An Improved FP-Growth Algorithm in Planning of Leisure Travel

  1. (College of Information Engineering, Beijing Institute of Graphic Communication, Beijing 102600, China)
  • Received:2017-07-04 Online:2018-03-08 Published:2018-03-09

摘要: 针对关联规则挖掘的FP-Growth算法存在对海量数据存储时消耗极大内存开销的弊端,提出一种对FP-Growth加入兴趣度的改进算法,然后与Apriori,FP-Growth算法进行比较,改进后的算法极大减少了内存开销,同时提高了系统执行效率。并且提出改进算法与旅游线路规划挖掘结合的理念,以云南旅游业作为旅游规划对象,充分应用旅游网站的大数据,设计一种旅游线路规划的挖掘系统,为旅游企业找出游客最喜欢的旅游线路以及景区之间的关联规则。

关键词: 数据挖掘, FP-Growth算法, 兴趣度, 旅游规划, 索引

Abstract: According to the disadvantage of consuming great memory cost in storing massive data existing in the FP-Growth algorithm of mining association rules, the paper presents an improved algorithm which adds interestingness to FP-Growth, and then compares it with Apriori and FP-Growth algorithm, the improved algorithm greatly reduces the memory cost and improves the efficiency of system execution. Based on the idea of combining the improved algorithm with the tourism route planning, taking Yunnan tourism as the tourism planning object and fully applying the large data of the tourism website, the paper designs a mining system for tourism route planning, and finds out the association rules between tourist routes and scenic spots.

Key words: data mining, FP-Growth algorithm, interestingness, tourism plan, index

中图分类号: