Computer and Modernization

Previous Articles     Next Articles

Data Analysis Method of Campus Profile Based on University Website

  

  1. (College of Information, Mechanical and Electronic Engineering, Shanghai Normal University, Shanghai 200234, China)
  • Received:2018-01-16 Online:2018-09-11 Published:2018-09-11

Abstract: This paper mines and analyzes university official website data, and proposes a phrase similarity calculation method based on the combination of phrase tree structure and CilinSimHash algorithm. The algorithm first converts the phrase into a tree with numbers as the root node to calculate the similarity, and then the Tongyici Cilin and SimHash algorithms are combined to calculate the similarity based on CilinSimHash algorithm, finally the similarity method based on phrase structure is weighted with the similarity method based on CilinSimHash algorithm to achieve the phrase similarity calculation. The algorithm is applied to the process of data analysis of university official website, and then the cluster analysis of university official website data is studied, the relationship between university official website data and college evaluation index is achieved. According to the structured data obtained from the official website data of colleges and universities, the clustering algorithm is used to analyze the related index data, which shows the unbalanced development of higher educations at different educational levels.

Key words: university official website, phrase similarity, SimHash, college evaluation index

CLC Number: