Computer and Modernization ›› 2010, Vol. 1 ›› Issue (11): 162-164,.doi: 10.3969/j.issn.1006-2475.2010.11.046

• 应用与开发 • Previous Articles     Next Articles

A New Method for Chinese New Word Identification Based on Inner Pattern of Word

LIN Zi-fang, JIANG Xiu-feng   

  1. College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350108, China
  • Received:2010-07-01 Revised:1900-01-01 Online:2010-11-25 Published:2010-11-25

Abstract: As to new word identification problem, this paper proposes a new method for Chinese new word identification based on the inner pattern of word. After repeat finding based on suffix arrays and longest common preffix, it propses the weighting of the improved PWP and inside word probabilities in view of the inner pattern of word. At the meanwhile, the paper uses AV and MI statistics to identify Chinese new words. By comparison, find that this method is effective in recognition of Chinese new words.

Key words: inner pattern of word, new word identification, improved PWP, inside word probabilities

CLC Number: