计算机与现代化

• 算法设计与分析 • 上一篇    下一篇

基于k-means和决策树的混合入侵检测算法

  

  1. (南京国电南自电网自动化有限公司,江苏南京211100)
  • 收稿日期:2017-07-13 出版日期:2017-12-25 发布日期:2017-12-26
  • 作者简介:李鹏(1977-),男,山东烟台人,南京国电南自电网自动化有限公司工程师,硕士,研究方向:配电网,能源互联网,电力大数据处理; 周文欢(1988-),男,江苏南京人,工程师,硕士,研究方向:数据库技术,数据处理。

Mixed Intrusion Detection Algorithm Based on k-means and Decision Tree

  1. (Nanjing SAC Automation Co. Ltd., Nanjing 211100, China)
  • Received:2017-07-13 Online:2017-12-25 Published:2017-12-26

摘要: 随着网络复杂度的增加,传统的入侵检测方法已经无法满足日益增长的安全需求。采用大数据的挖掘算法提高入侵检测的检测率是当前研究的热点。为此,本文提出一种基于k-means和决策树算法的混合入侵检测算法(KDI)。该算法首先对数据预处理的离散化方法进行改进,获取高质量样本数据,并根据现实中易出现类别信息增益比差异小的特点,利用k-means算法根据增益比差异将样本数据先分类再建立决策树,提升了算法的检测率。实验结果表明KDI算法能够有效地检测网络数据中隐含的已知和未知的入侵行为。

关键词: k-means, 决策树, 入侵检测, 数据离散化

Abstract: With the growth of the network complexity, the traditional intrusion detection methods have been unable to meet the high-level security requirements. How to use data mining algorithm to improve accuracy rate of intrusion detection is a hot spot in current research. For this purpose, a hybrid intrusion detection algorithm based on k-means and decision tree algorithm (KDI) is proposed. Firstly, an improvement on data discretization method is advanced, in order to obtain high quality sample data, and then the k-mean algorithm is utilized to classify the sample data based on the feature of slight difference between information divergence ratio in many real situations, subsequently, the decision trees is constructed, therefore, the detection rate is enhanced. The experimental results show that the KDI algorithm can effectively detect both known and unknown intrusion behaviors sealed in network data.

Key words: k-means, decision tree, intrusion detection, data discretization

中图分类号: