计算机与现代化

• 人工智能 • 上一篇    下一篇

基于兴趣模型的查询扩展

  

  1. (1.装甲兵工程学院信息系,北京 100072; 2.装甲兵工程学院科研部,北京 100072)
  • 收稿日期:2014-03-05 出版日期:2014-07-16 发布日期:2014-07-17
  • 作者简介:田永昌(1989-),男,湖北汉川人,装甲兵工程学院信息系硕士研究生,研究方向:自然语言处理; 李颖(1964-),男,江西吉安人,装甲兵工程学院科研部副教授,博士,研究方向:自然语言处理,人工智能。

Query Expansion Based on Interest Model

  1. (1. Department of Information Engineering, Academy of Armored Force Engineering, Beijing 100072, China;
    2. Department of Scientific Research, Academy of Armored Force Engineering, Beijing 100072, China)
  • Received:2014-03-05 Online:2014-07-16 Published:2014-07-17

摘要: 针对信息检索中查询与文档集之间可能存在的“词不匹配”问题,基于兴趣模型提出一种将概念化的兴趣知识与向量空间模型相结合的查询扩展方法。该方法能根据阈值来判断查询扩展是否可行。用户的兴趣偏好是通过Agent代理实时获取的,兴趣知识采用HNC(Hierarchical Network of Concepts, 概念层次网络)理论的概念符号体系表达,这样便于计算概念之间的相似度。实验结果表明,经过查询扩展后的结果相对于未加入查询扩展的结果在性能上提高了29.1%。

关键词: 信息检索, 查询扩展, 兴趣模型, 概念相似度, HNC

Abstract: To solve the problem that words mismatch between the retrieval and the documents in the query information, this paper proposes a query expansion method based on interest model, and it is combined with conceptual vector space model, and can judge whether it is feasible to query expansion according to the threshold value. In this method, user’s interest is mined by Agent in real time, and the intellectual interests are expressed by the HNC (hierarchical network of concepts). As a result, it is very convenient to calculate the similarity between concepts. Experimental result shows that the performance has 29.1% improvement by using the query expansion compared to the non-query expansion.

Key words: information retrieval, query expansion, interest model;concept similarity, HNC

中图分类号: