计算机与现代化

• 应用与开发 • 上一篇    下一篇

基于最长公共子序列的微博谣言溯源研究

  

  1. 中国人民公安大学信息技术与网络安全学院,北京100038
  • 收稿日期:2017-05-12 出版日期:2018-01-23 发布日期:2018-01-24
  • 作者简介:李城(1994-),男,山西晋城人,中国人民公安大学信息技术与网络安全学院硕士研究生,研究方向:自然语言处理与信息溯源; 沙俊淞(1994-),男,江西赣州人,硕士研究生,研究方向:自然语言处理; 武文(1992-),男,山西人,硕士研究生,研究方向:云计算服务等级协议合规性与信息溯源。
  • 基金资助:
    国家“863”计划项目(2015AA016009)

Research on Origin of Micro-blog Rumors Based on Longest Common Subsequence

  1. College of Information Technology and Cybersecurity, People’s Public Security University of China, Beijing 100038, China
  • Received:2017-05-12 Online:2018-01-23 Published:2018-01-24

摘要: 手机等移动设备的普及,使得微博等社交网络成为信息发布和共享的重要渠道。但同时,大量的反动、虚假、色情信息充斥着整个网络,谣言的恶劣影响日益突出,一些谣言的出现已经严重影响了人们对网络信息的获取和正常使用。如何对网络中的各类谣言进行检测,挖掘出谣言的源头及传播方式成为当前公安网信部门亟需解决的问题。本文以微博谣言为例,根据现有的LCS最长公共子序列算法在构造序列表格时做了相应的改进,并根据改进的LCS算法比对分析微博谣言。初步实验表明,改进后的算法可以更高效地对微博谣言进行比对溯源,从而帮助公安机关发现微博谣言源头。

关键词: 微博谣言, 溯源, LCS算法, 动态规划

Abstract: Mobile phones and other mobile devices, make micro-blog and other social networks become an important channel for information dissemination and sharing. But at the same time, a large number of reactionary, false and pornographic information filled the entire network, and the rumors of the increasingly prominent adverse effects. Furthermore, the emergence of some rumors has seriously affected people’s access to network information and normal use. How to detect all kinds of rumors in the network, and to find out the source of the rumor and the way of communication has become an urgent problem for public security network department to be solved. Micro-blog rumors, for example, according to the existing longest common subsequence(LCS) algorithm, the corresponding improvement is made in the construction of sequence table. The preliminary experiments show that the improved algorithm can be more efficient to compare and originate micro-blog rumors, so as to help public security network department to find the source of micro-blog rumors.

Key words: micro-blog rumors, origin, LCS algorithm, dynamic programming

中图分类号: