Computer and Modernization

Previous Articles     Next Articles

Hail Information Extraction Based on Sina Weibo

  

  1. (School of Electrical Engineering and Automation, Tianjin University, Tianjin 300072, China)
  • Received:2015-10-23 Online:2016-03-17 Published:2016-03-17

Abstract: To obtain accurate hail information more easily and quickly, a three-level identification is designed, which is the first identification of microblog containing “hail” through Web crawler technology, the second identification of hail events based on classifier and the third identification of hail element information based on rules. In order to improve identification performance of hail events, an assessment function for extracting features is added, and a multi-assessment function to determine the feature vectors is proposed. Then a scheme based on combination of three classifiers is given. The test results show that hail events extraction rate is 89.5% by the presented method, mistaken identification rate is less than 13.4%; hail element information extraction rate is more than 96.0%, mistaken identification rate is less than 8.6%.

Key words: microblog, hail information, feature extraction, text classification, text elements recognition, Web crawler

CLC Number: