Computer and Modernization

Previous Articles     Next Articles

Intrusion Detection Data Classification by Distributed Computing

  

  1. 1. School of Computer & Information Engineering, Changzhou Institute of Technology, Changzhou 213002, China;

    2. School of Economics and Management, Changzhou Institute of Technology, Changzhou 213002, China
  • Received:2015-08-05 Online:2015-12-23 Published:2015-12-30

Abstract: To handle huge amounts of network data effectively which is increasing rapidly, Naive Bayesian parallel algorithm and Logistic Regression parallel algorithm were used to analyze the intrusion detection big data based on Hadoop which is a cloud computing system. The intrusion detection data was computed in the model of pseudodistribution model and distribution model. The experimental results show that the classification accuracy of the two algorithms can exceed 90% and Logistic Regression algorithm spent more time than Naive Bayesian algorithm. Naive Bayesian algorithm can reduce run time effectively in Hadoop cluster. So Naive Bayesian algorithm is more effectively than Logistic Regression algorithm with the classification accuracy and the algorithm running time considered. Naive Bayesian algorithm can analyze the intrusion detection big data.

Key words: intrusion detection, Naive Bayesian, Logistic Regression

CLC Number: