计算机与现代化

• 数据挖掘 • 上一篇    下一篇

情报分析中提取主题信息核心要素的模型及方法

  

  1. (韩城市司马迁图书馆,陕西韩城715400)
  • 收稿日期:2018-03-14 出版日期:2018-10-26 发布日期:2018-10-26
  • 作者简介:田丽(1975-),女,陕西合阳人,韩城市司马迁图书馆图书资料馆员,大专毕业,研究方向:情报分析,地方文献研究,图书管理。

Model and Approach for Extracting Key Elements #br# From Topic Information in Intelligence Analysis

  1. (Hancheng City Sima Qian Library, Hancheng 715400, China)
  • Received:2018-03-14 Online:2018-10-26 Published:2018-10-26

摘要: 在分析主题信息基本组成特征的基础上,根据统计学原理,结合向量空间与随机事件空间的构造理论,建立情报主题空间的数学模型;给出基于主题信息提取分量的检索策略以及最大近似主题空间的设计原则。提出一种提取主题信息关键词的具体方法。该方法由3个环节9个步骤组成,可确保抽样的广泛性和代表性,能提取主题信息中获得最大共识的核心要素。最后给出从科研选题的主题信息中挖掘选题关键要素的实例。

关键词: 数据挖掘, 情报分析, 主题信息, 数学模型, 关键词

Abstract: Based on analysis on the characteristics of the topic information and according to principles of statistics, the paper puts forwards a mathematical model for space of intelligence topic information by means of the constructive theories of vector space and random event space; strategy of retrieving components from topic information is proposed and design rule of constructing a maximum approximate space of topic information is given. The paper also raises a concrete method to extract the keywords from topic information. The method consists of 3 parts with 9 steps, can extract the keywords with maximal common views in topic information that is built from samples of with universality and representativeness. In the end, the paper presents an example that mines the key elements for a topic of scientific research from the topic information of choosing a scientific subject.

Key words: data mining, intelligence analysis, topic information, mathematical model, keywords

中图分类号: