计算机与现代化 ›› 2013, Vol. 1 ›› Issue (4): 15-17.doi: 10.3969/j.issn.1006-2475.2013.04.004

• 人工智能 • 上一篇    下一篇

基于多特征的中文关键词抽取方法

黄 轩1,2,李 伟1   

  1. 1.厦门大学智能科学与技术系,福建 厦门 361005;2.漳州职业技术学院经济管理系,福建 漳州 363000
  • 收稿日期:2012-12-04 修回日期:1900-01-01 出版日期:2013-04-17 发布日期:2013-04-17

Chinese Keyword Extraction Method Based on Multi-features

HUANG Xuan1,2, LI Wei1   

  1. 1. Department of Cognitive Science, Xiamen University, Xiamen 361005, China;2. Department of Economic Management, Zhangzhou Institute of Technology, Zhangzhou 363000, China
  • Received:2012-12-04 Revised:1900-01-01 Online:2013-04-17 Published:2013-04-17

摘要: 中文关键词提取是自然语言理解中一个非常值得研究的问题,通过采用词语的TFR*IDF词性,首现位置的特征作为抽取中文文献关键词的特征,其中特征的权重采用BP神经网络训练获得,最终获取中文关键词。通过仿真实验模拟,证明该方法是有效的。

关键词: 关键词抽取, TFR*IDF, 多特征, BP神经网络

Abstract: Chinese keyword extraction is a worthy problem for study in natural language understanding. By using the TFR*IDF, part of speech, the first of the current position of features as the extraction characteristics of the Chinese document keywords extraction, while features of weight trained by the BP neural network, eventually it can get Chinese keyword. By analog simulation, it is proved that the method is effective.

Key words: keyword extraction, TFR*IDF, multi-features, BP neural network

中图分类号: