Computer and Modernization

Previous Articles     Next Articles

Automatic Text Summarization Algorithm Based on Sentence Weight and Chapter Structure

  

  1. 1. Hunan Testing Institute of Product and Commodity Supervision, Changsha 410007, China; 
    2.College of Mathematics and Computer Science, Hunan Normal University, Changsha 410081, China; 
    3. Key Laboratory of High Performance Computing and Stochastic Information Processing, 
    Ministry of Education of China, Changsha 410081, China
  • Received:2015-10-15 Online:2015-12-23 Published:2015-12-30

Abstract: To improve the accuracy of automatic text summarization can help people to obtain the valuable information simpler and more efficient. According to the structural characteristics of government documents, this paper proposed an automatic summarization algorithm based on sentence weight and chapter structure. First, from the accurate statistics information of sentences and words in the document, the article content and a basic understanding of textual structure can be obtained. Then through the calculation of words’ weight and sentences’ weight, sentences can be sorted. According to the size of the summarization, the candidate summary sentences can be chosen. Finally, after doing some postprocessing, the final sentences of the text summarization can be output. The results of experiment show that, compared with the similar algorithm, the accuracy rate and the recall rate in our algorithm are improved a lot.

Key words:  , government documents; automatic text summarization; word weight; sentence weight; chapter structure

CLC Number: