计算机与现代化

• 应用与开发 • 上一篇    下一篇

基于关联规则的热点事件时序分析方法

  

  1. (中国石油大学(华东)计算机与通信工程学院,山东青岛266580)
  • 收稿日期:2018-01-26 出版日期:2018-09-11 发布日期:2018-09-11
  • 作者简介:王奕文(1992-),女,河北承德人,中国石油大学(华东)计算机与通信工程学院硕士研究生,研究方向:数据挖掘,舆情分析; 〖JP2〗刘昕(1974-),女,山东潍坊人,副教授,博士,研究方向:网络安全,社会计算; 曹帅(1995-),男,山东泰安人,硕士研究生,研究方向:数据挖掘,舆情分析,网络安全; 王丰(1992-),男,山东潍坊人,硕士研究生,研究方向:舆情分析,网络安全。
  • 基金资助:
    国家自然科学基金资助项目(61309124); 山东省重点研发计划项目(2017GGX10140)

Time Series Analysis of Hot Event Based on Association Rules

  1. (College of Computer and Communication Engineering, China University of Petroleum, Qingdao 266580, China)
  • Received:2018-01-26 Online:2018-09-11 Published:2018-09-11

摘要: 热点事件在发展过程中包括多个相关话题,分析多个话题在时序上的演化和传播路径,能够深层次把握热点事件产生、发展、消亡的具体细节。为此提出一种基于关联规则的热点事件时序分析方法。首先将关联规则算法并行实现获取多个时间片的频繁关键词集;然后筛选所有频繁关键词集的关联规则形成关联规则集,从而得到多个话题关键词集合;最后根据关键词集合分析热点事件多个话题的演化和传播路径。实验表明,该方法能够全面有效地跟踪热点事件的动态变化过程,为网络舆情监控和管理提供借鉴和支撑。

关键词: 关联规则, 话题演化, 话题传播路径, 时序分析

Abstract: As the hot event includes a number of related topics in the development process, analyzing the evolution and propagate path in time-sequence of topics can deeper grasp the specific details of the emergence, development and demise of hot event. In this situation, a method of time series analysis of hot event based on association rules is proposed. Firstly, the frequent keyword sets of multiple time slices are obtained by implementing association rules in parallel. Secondly, the association rule sets are obtained by selecting all of association rule of frequent keyword sets, so as to get keyword sets of multiple topics. Finally, the evolution and propagation path of multiple topics are analyzed according to the keyword sets. Experiments show that the method can track the dynamic change process of hot events comprehensively and effectively, which provides reference and support for network public opinion on the monitoring and management.

Key words: association rules, topic evolution, topic propagation path, time series analysis

中图分类号: