基于k-means和决策树的混合入侵检测算法

doi:10.3969/j.issn.1006-2475.2017.12.003

计算机与现代化 ›› 2017, Vol. 0 ›› Issue (12): 12-16.doi: 10.3969/j.issn.1006-2475.2017.12.003

基于k-means和决策树的混合入侵检测算法

（南京国电南自电网自动化有限公司，江苏南京211100）

收稿日期:2017-07-13 出版日期:2017-12-25 发布日期:2017-12-26
作者简介:李鹏(1977-),男,山东烟台人,南京国电南自电网自动化有限公司工程师,硕士,研究方向：配电网，能源互联网，电力大数据处理；周文欢(1988-),男,江苏南京人，工程师,硕士,研究方向：数据库技术，数据处理。

Mixed Intrusion Detection Algorithm Based on k-means and Decision Tree

（Nanjing SAC Automation Co. Ltd., Nanjing 211100, China）

Received:2017-07-13 Online:2017-12-25 Published:2017-12-26

摘要/Abstract

摘要： 随着网络复杂度的增加，传统的入侵检测方法已经无法满足日益增长的安全需求。采用大数据的挖掘算法提高入侵检测的检测率是当前研究的热点。为此，本文提出一种基于k-means和决策树算法的混合入侵检测算法(KDI)。该算法首先对数据预处理的离散化方法进行改进，获取高质量样本数据，并根据现实中易出现类别信息增益比差异小的特点，利用k-means算法根据增益比差异将样本数据先分类再建立决策树，提升了算法的检测率。实验结果表明KDI算法能够有效地检测网络数据中隐含的已知和未知的入侵行为。

关键词: k-means, 决策树, 入侵检测, 数据离散化

Abstract: With the growth of the network complexity, the traditional intrusion detection methods have been unable to meet the high-level security requirements. How to use data mining algorithm to improve accuracy rate of intrusion detection is a hot spot in current research. For this purpose, a hybrid intrusion detection algorithm based on k-means and decision tree algorithm (KDI) is proposed. Firstly, an improvement on data discretization method is advanced, in order to obtain high quality sample data, and then the k-mean algorithm is utilized to classify the sample data based on the feature of slight difference between information divergence ratio in many real situations, subsequently, the decision trees is constructed, therefore, the detection rate is enhanced. The experimental results show that the KDI algorithm can effectively detect both known and unknown intrusion behaviors sealed in network data.

Key words: k-means, decision tree, intrusion detection, data discretization

中图分类号:

TP391.4

李鹏，周文欢. 基于k-means和决策树的混合入侵检测算法[J]. 计算机与现代化, 2017, 0(12): 12-16.

LI Peng， ZHOU Wen-huan. Mixed Intrusion Detection Algorithm Based on k-means and Decision Tree[J]. Computer and Modernization, 2017, 0(12): 12-16.

参考文献

［1］蒋建春,马恒太,任党恩,等. 网络安全入侵检测: 研究综述［J］. 软件学报, 2000,11(11):1460-1466. ［2］卿斯汉,蒋建春,马恒太,等. 入侵检测技术研究综述［J］. 通信学报, 2004,25(7):19-29. ［3］李云婷,夏仲平,熊婧. 入侵检测系统的多层次混合评价方法研究［J］. 计算机科学, 2015,42(s1)：425-428. ［4］华铭轩,张峰军. 大数据环境下的入侵检测系统框架［J］. 通信技术, 2015,48(11):1300-1304. ［5］夏秦,王志文,卢柯. 入侵检测系统利用信息熵检测网络攻击的方法［J］. 西安交通大学学报, 2013,47(2):14-19. ［6］龚良强,殷小虹. 基于协议分析的入侵检测系统的设计［J］. 信息通信, 2014(6):90. ［7］马志远,曹宝香. 改进的决策树算法在入侵检测中的应用［J］. 计算机技术与发展, 2014,24(1):151-154. ［8］膝少华，严远驰，刘冬宁，等. 基于FCM-C4.5的双过滤入侵检测机制［J］. 计算机应用与软件， 2016,33(1):307-311. ［9］Pfahringer B. Winning the KDD99 classification cup: Bagged boosting［J］. ACM SIGKDD Explorations Newsletter, 2000,1(2):65-66. ［10］占善华,张巍,滕少华. 基于核表示的协同入侵检测方法［J］. 计算机工程与设计, 2013,34(7):2310-2314. ［11］周静,赵鲁阳,罗炬锋. 基于时域特征提取的围栏入侵模式分类方法［J］. 计算机工程与应用, 2016(12):1-8. ［12］任晓芳,赵德群,秦健勇. 基于随机森林和加权K均值聚类的网络入侵检测系统［J］. 微型电脑应用, 2016,32(7):21-24. ［13］Zhang Jiong, Zulkernine M. Anomaly based network intrusion detection with unsupervised outlier detection［C］// 2006 IEEE International Conference on Communications. 2006:2388-2393. ［14］Pan Zhi-song, Chen Song-can, Hu Gen-bao, et al. Hybrid neural network and C4.5 for misuse detection［C］// Proceedings of the 2nd IEEE International Conference on Machine Learning and Cybernetics. 2003:2463-2467. ［15］Peddabachigari S, Abraham A, Grosan C, et al. Modeling intrusion detection system using hybrid intelligent systems［J］. Journal of Network and Computer Applications, 2007,30(1):114-132. ［16］Xiang C, Yong P C, Meng L S. Design of multiple-level hybrid classifier for intrusion detection system using Bayesian clustering and decision trees［J］. Pattern Recognition Letters, 2008,29(7):918-924. ［17］贺跃,郑建军,朱蕾. 一种基于熵的连续属性离散化算法［J］. 计算机应用, 2005,25(3):637-638. ［18］Fayyad U M, Irani K B. On the handling of continuous-valued attributes in decision tree generation［J］. Machine Learning, 1992,8(1):87-102. ［19］陈臣，周炎涛. 基于改进信息熵离散化算法的研究［DB/OL］. http://www.paper.edu.cn, 2011-06-11.

[1]	王涛1, 2, 黄丹1, 2, 刘禅奕1, 2, 朱桃1, 2. 基于YOLOv5s的无人机图像车辆检测[J]. 计算机与现代化, 2024, 0(08): 108-113.
[2]	秦阳, 詹勇, 明路遥, 杨舒淇, 蓝振祎. 基于改进K-means算法的通勤交通小区识别[J]. 计算机与现代化, 2024, 0(07): 63-68.
[3]	苏凯旋. 基于改进XGBoost模型的网络入侵检测研究[J]. 计算机与现代化, 2024, 0(06): 109-114.
[4]	孟雅蕾1, 师红宇1, 王予2. 一种无阻流量预测方法[J]. 计算机与现代化, 2024, 0(04): 33-37.
[5]	韩雪. 基于约束聚类和粒子群算法的多路径规划[J]. 计算机与现代化, 2023, 0(08): 7-11.
[6]	王艺成, 张国良, 张自杰, . 基于改进YOLOv5的小目标检测方法[J]. 计算机与现代化, 2023, 0(05): 100-105.
[7]	潘裕庆, 张苏宁, 冯仁君, 景栋盛. 结合粒子群优化和LightGBM的入侵检测方法[J]. 计算机与现代化, 2023, 0(04): 123-126.
[8]	彭露露, 朱媛媛, 金文倩, 王笑梅. 基于改进YOLOv4的汽车钢铁零件表面缺陷检测[J]. 计算机与现代化, 2022, 0(09): 32-39.
[9]	申智, 徐丽, 符祥远. 基于改进YOLO v4光线模糊场景下交通标志检测[J]. 计算机与现代化, 2022, 0(07): 27-32.
[10]	饶海兵, 朱苏磊, 杨春夏. 基于空时特征融合和注意力机制的网络入侵检测模型[J]. 计算机与现代化, 2022, 0(06): 116-121.
[11]	梁正友, 王璐, 李轩昂, 杨锋, . 基于K-means++的多视图点云配准技术[J]. 计算机与现代化, 2022, 0(02): 97-101.
[12]	肖宏宇, 曾文驱, 王淑营. 基于模型特征匹配的BIM模型混合推荐算法[J]. 计算机与现代化, 2022, 0(01): 28-32.
[13]	金鑫, 曾思轲, 刘阳, 武楚涵. 基于改进YOLOv4的口罩佩戴检测算法[J]. 计算机与现代化, 2022, 0(01): 85-90.
[14]	吴水明, 吉志远, 王震宇, 景栋盛. 基于Dueling-DDQN的电力信息网络入侵检测算法[J]. 计算机与现代化, 2021, 0(12): 43-47.
[15]	庄丽丽, 石鸿雁. 基于改进布谷鸟搜索的k-means算法的离群点检测[J]. 计算机与现代化, 2021, 0(10): 15-22.

基于k-means和决策树的混合入侵检测算法

Mixed Intrusion Detection Algorithm Based on k-means and Decision Tree

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价