计算机与现代化 ›› 2022, Vol. 0 ›› Issue (02): 33-37.

• 算法设计与分析 • 上一篇    下一篇

基于改进FP-growth的海上群目标挖掘

  

  1. (华北计算技术研究所,北京100083)
  • 出版日期:2022-03-31 发布日期:2022-03-31
  • 作者简介:岳建成(1997—),男,河南信阳人,硕士研究生,研究方向:大数据,数据挖掘,E-mail: jiancheng952@163.com; 王玉玫(1962—),女,研究员级高级工程师,研究方向:人工智能,大数据分析,E-mail: wym_email@263.net; 吴亚非(1977—),男,河南新乡人,高级工程师,本科,研究方向:软件工程,指挥自动化系统设计,E-mail: yfnow@sina.com; 臧义华(1980—),女,辽宁抚顺人,高级工程师,硕士,研究方向:软件工程,计算机图形应用系统,E-mail: zangyihua@126.com。

Marine Group Target Mining Based on Imporved FP-growth

  1. (North China Institute of Computing Technology, Beijing 100083, China)
  • Online:2022-03-31 Published:2022-03-31

摘要: 海上目标状态呈现复杂多变的形势,须快速挖掘海上船舶的群组信息,以掌握海上目标态势。本文使用改进的FP-growth算法对海上船舶进行数据挖掘,使用基于时空分割的方法划分目标区域,挖掘频繁项集。首先清洗原始数据得到有效数据;其次使用线性插值方法处理船舶的轨迹方便后续计算;然后使用FP-growth算法,构建生成FP-tree;最后得到频繁项集,挖掘海上船舶群组信息。针对基于项集划分关联分析查找效率低的问题,本文使用基于Hash表拆分数据库和结点交换的方法挖掘频繁项集,在内存占用和时间消耗两方面比较算法的效率。使用AIS数据集进行验证,在给定的置信度和支持度下挖掘目标群组信息,验证改进算法的高效率。

关键词: FP-growth算法; 海上群目标, 时空数据;Hash表, 结点交换

Abstract: The status of marine targets presents a complex and changeable situation. It needs to quickly excavate the group information of marine ships and provide group data support for mastering the situation of marine targets. This paper uses improved FP-growth algorithm to mine marine ships’ data, and uses the method of spatio-temporal segmentation to divide the targets area and mine frequent items. First, the original data is cleaned to get the effective data; secondly, the linear interpolation method is used to process the ship trajectory for subsequent calculation; then, FP-growth algorithm is used to build FP-tree; finally, the frequent term set is obtained to mine the information of marine ship groups. Aiming to the problem of low efficiency of association analysis based on itemset partition, this paper uses Hash table to split database and the method of node exchange to mine frequent itemsets, and compares the efficiency of the algorithm in memory consumption and time consumption. The test is done on AIS data set to verify the efficiency of the improved algorithm, with the given confidence and support of the target group information.

Key words: FP-growth algorithm, marine group targets, spatio-temporal data, Hash table, node exchange