计算机与现代化 ›› 2011, Vol. 1 ›› Issue (8): 9-12,1.doi: 10.3969/j.issn.1006-2475.2011.08.003

• 人工智能 • 上一篇    下一篇

面向问题分类的汉语框架网特征选择

王文晶1,宋小香2,李 茹2   

  1. 1.山西大学商务学院信息学院,山西 太原 030031; 2.山西大学计算机与信息技术学院,山西 太原 030006
  • 收稿日期:2011-05-09 修回日期:1900-01-01 出版日期:2011-08-10 发布日期:2011-08-10

Question Classification-oriented Chinese FrameNet Feature Selection

WANG Wen-jing1, SONG Xiao-xiang2, LI Ru2   

  1. 1. College of Information, Business College of Shanxi University, Taiyuan 030031, China;2. School of Computer & Information Technology, Shanxi University, Taiyuan 030006, China
  • Received:2011-05-09 Revised:1900-01-01 Online:2011-08-10 Published:2011-08-10

摘要: 特征选择是影响问答系统中问题分类的重要因素。本文充分利用汉语框架网在语义表达方面的特点,提出一种面向问题分类的强类别信息词(SCIW)特征选择方法。首先选择五种汉语框架网特征作为候选特征,然后采用SCIW特征选择方法,根据每一类别的分类精度对单个特征的分类能力进行排序,并通过特征组合实验,选出具有最好分类效果的组合特征,达到特征约简的效果。

关键词: 汉语框架网, 问题分类, 特征选择

Abstract: Feature selection is the important factor which affects the question classification of question answering system. By fully using the characteristics of Chinese FrameNet in terms of semantic expression, this paper presents a new question classification-oriented approach in feature selection called strong class information words (SCIW). Firstly, it selects five kinds of Chinese FrameNet features as candidate features, and then uses SCIW to select features. According to each category’s classification precision of features, it sorts the classification ability of each single feature. Through the experiment of combinations of features, it selects the combination of features, which has better classification results. So the feature reduction can be reached.

Key words: Chinese FrameNet, question classification, feature selection

中图分类号: