Computer and Modernization

Previous Articles    

Simulation of Web Crawler Detection Algorithm Based on Hidden Markov Model

  

  1. College of Fenghuo, Wuhan Research Institute of Posts and Telecommunications, Wuhan 430074, China
  • Received:2016-08-30 Online:2017-04-20 Published:2017-05-08

Abstract: In the construction and maintenance process of the website, in order to improve server efficiency, strengthen security and confidentiality, developers need to distinguish between human users and Web crawlers. However, some inappropriate or malicious designs make it difficult to detect crawlers. These crawlers not only increase the burden on the site, but also endanger the security of network. In order to solve the problem that it is difficult to detect crawlers, a detection algorithm based on behavior pattern is proposed, which uses hidden Markov model to describe the behavior patterns of different clients and uses Matlab simulation to achieve a highly accurate detection result. The simulation results show that the detection technology of hidden Markov model can detect Web crawler with high accuracy and low error rate.

Key words:  Web crawler detection, behavior pattern, hidden Markov model, network security

CLC Number: