计算机与现代化

• 网络与通信 • 上一篇    下一篇

基于改进Kmeans的网络舆情热点事件发现技术

  

  1. 江苏科技大学经济管理学院,江苏镇江212003
  • 收稿日期:2013-12-17 出版日期:2014-04-17 发布日期:2014-04-23
  • 作者简介:作者简介:孙玲芳(1963),男,江苏镇江人,江苏科技大学经济管理学院教授,研究方向:电子商务与信息管理; 周加波(1989),男,硕士研究生,研究方向:网络群体事件,网络舆情。
  • 基金资助:
     
    基金项目:教育部人文社科基金资助项目(10YJAZH069); 江苏省“六大人才高峰”高层次人才项目(XXRJ013)

Clustering of Network Public Opinion Hot Issues Detection Based on Improved Kmeans

  1. School of Economics and Management, Jiangsu University of Science and Technology, Zhenjiang 212003, China
  • Received:2013-12-17 Online:2014-04-17 Published:2014-04-23

摘要:  

摘要: 基于网络舆情监控的需要,设计一个网络舆情热点事件自动发现模型,包括舆情信息采集、中文分词、特征选择、文本分词和聚类分析。对Kmeans算法进行改进,减少算法对孤立点的敏感性和降低算法的时间和空间复杂度。利用F1值对改进的Kmeans算法和传统Kmeans算法进行性能比较,证明了该模型的可行性与有效性。

关键词: 网络舆情, VSM, 改进Kmeans算法, 热点事件

Abstract:  

Abstract:  Based on the needs of the network public opinion monitoring, this paper designs a model for automatic discovering the network public opinion hot issues. The system includes public opinion information acquisition, Chinese word splitter, feature selection, text segmentation and clustering analysis. By improving the Kmeans algorithm, the sensitivity of the algorithm for outlier is reduced, and the time and space complexity of the algorithm is reduced also. This paper makes use of F1 value to compare the improved Kmeans algorithm with the traditional Kmeans algorithm, which obtains satisfactory results and proves the feasibility and effectiveness of this model.

Key words:

中图分类号: