Computer and Modernization

Previous Articles     Next Articles

Optimizing Web Page Classification Algorithm by Using Hyperlinks

  

  1. College of Computer, Beijing University of Technology, Beijing 100124, China
  • Received:2014-03-28 Online:2014-05-28 Published:2014-05-30

Abstract: There is a problem in the Web page classification algorithm by using hyperlinks, the noise neighbors interfere with the results of the classification. To solve the problem an optimization method was presented, which utilizes the similarities between pages. If neighbors meet the thresholds, they are set different weights for different relationships. The results of classification by support vector machine are also used. Experiment shows that it increases in precision, recall and F1 value.

Key words:  Web page classification, neighboring page, similarity, support vector machine

CLC Number: