计算机与现代化

• 网络与通信 • 上一篇    下一篇

一种基于用户关联分析的热点话题识别算法

  

  1. 华北计算技术研究所总体部,北京100083
  • 收稿日期:2013-10-22 出版日期:2014-01-20 发布日期:2014-02-10
  • 作者简介: 张昭(1988-),男,湖北公安人,华北计算技术研究所总体部硕士研究生,研究方向:信息安全,数据挖掘; 艾中良(1971-),男,河北保定人,主任,正研级高级工程师,研究方向:信息应用,信息管理与共享。

 A Hot Topic Identification Algorithm Based on User Relevance Analysis

  1. General Department, North China Institute of Computing Technology, Beijing 100083
  • Received:2013-10-22 Online:2014-01-20 Published:2014-02-10

摘要: 为了提高从社交网络文本信息中发现热点话题的准确率,提出一种基于用户关联分析的热点话题识别算法。该算法综合考虑词频变化率和用户权威度,词频变化率通过EMA和MACD等指标来计算,用户权威度通过建立用户关联图的方式来计算。使用基于HITS算法的话题热度度量计算方法,将词频变化率数据和用户权威度数据结合在一起,得到话题的热度值。实验结果表明,使用基于用户关联分析的热点话题识别算法能够提高热点话题发现准确率。

关键词:  , 话题检测, 用户权威度度量, 特征变化率度量

Abstract: To improve the accuracy of detecting hot topic from social network text information, a hot topic identification algorithm based on user relevance analysis is raised. The algorithm considers both the frequency change rate of feature word and the authority of users. The frequency change rate of feature word is elevated using EMA and MACD indicators and the authority of users is calculated by creating user relevance graph. We use a method based on HITS algorithm to calculate the hot value of topic by combining feature frequency change rate data with user authority data together. According to the result of the experiment, the hot topic identification algorithm based on user relevance analysis can raise the accuracy of hot topic identification.

Key words:  hot topic detection, user authority calculation, feature change rate calculation