Computer and Modernization ›› 2010, Vol. 1 ›› Issue (3): 141-3.doi: 10.3969/j.issn.1006-2475.2010.03.040

• 中文信息技术 • Previous Articles     Next Articles

Application Study of Automatic Text Classification Combined with Language Model

ZHAO Min-ya   

  1. Department of Computer Engineering, Suzhou Vocation University, Suzhou 215104, China
  • Received:2009-02-19 Revised:1900-01-01 Online:2010-03-20 Published:2010-03-20

Abstract:

This paper studies the application of bigram model from statistical language model in the automatic text classification. Referring to the shortcoming of the hypothesis that the terms are independent from each other in VSM (Vector Space Model), it puts forward a method to improve the result of text classification with mutual words’ information and sequence. The experiment shows that the method is feasible and efficient.

Key words: statistical language model, text classification, smoothing, bigram