Computer and Modernization ›› 2017, Vol. 0 ›› Issue (12): 77-81+87.doi: 10.3969/j.issn.1006-2475.2017.12.015

Previous Articles     Next Articles

Topic Event Extraction Technology Based on LDA Model and AP Clustering Method

  

  1. (North China Institute of Computing Technology, Beijing 100083, China)
  • Received:2017-05-17 Online:2017-12-25 Published:2017-12-26

Abstract: At present, the event extraction technology is usually the direct extraction of the event information of the text, ignoring the information structure of text, and the result is susceptible to the distribution of the words in texts. This paper analyzes the hierarchical concept structure of the text, and proposes a method of extracting the topic event information of news based on two-stage clustering and subdividing. This method can extract the hierarchical topic-event information, and reduce the influence of the information of the relevant events by the two-stage extraction of information words. This way optimizes the performance of the extraction. And experiment shows that this method can extract the topic event information of the text effectively.

Key words: topic event extraction, LDA topic model, AP clustering method, hierarchical information, two-stage extraction

CLC Number: