计算机与现代化 ›› 2009, Vol. 8 ›› Issue (8): 4-6.doi: 10.3969/j.issn.1006-2475.2009.08.002

• 算法分析与设计 • 上一篇    下一篇

基于机器学习的文本分类技术的研究

何国辉1,吴礼发2
  

  1. 1. 解放军理工大学指挥自动化学院研究生一队,江苏 南京 210007; 2. 解放军理工大学指挥自动化学院计算机系,江苏 南京 210007
  • 收稿日期:2008-08-18 修回日期:1900-01-01 出版日期:2009-08-21 发布日期:2009-08-21

Research on Text Categorization Based on Machine

HE Guo-hui1,WU Li-fa2
  

  1. 1. Postgraduate Team 1st, Institute of Command Automation, PLAUST, Nanjing 210007, China;2. Department of Computer Science ICA, PLAUST, Nanjing 210007, China
  • Received:2008-08-18 Revised:1900-01-01 Online:2009-08-21 Published:2009-08-21

摘要: 基于机器学习的文本分类是近年来信息检索领域的热门研究技术,并且取得了较大进展。本文对文本分类的定义、文本表示进行了详细的阐述,介绍了SVM等一系列机器学习的文本分类方法和文本分类效果评估手段,指出了进一步的研究方向。

关键词: 文本分类, 向量空间模型, 特征提取, 机器学习 

Abstract: Text categorization based on machine learning is a widely used technology in the field of information retrieval and has gained many advances in recent years. The paper expounds the definition of text categorization and text representation, introduces some machine learning methods applied in text categorization and evaluation methods of text classifiers.

Key words: text categorization, vector space model, feature extraction, machine learning

中图分类号: