计算机与现代化 ›› 2010, Vol. 1 ›› Issue (01): 26-28.doi: 10.3969/j.issn.1006-2475.2010.01.008

• 算法分析与设计 • 上一篇    下一篇

基于回归模型的数据挖掘研究

孟晓东,袁道华,施惠丰   

  1. 四川大学计算机学院,四川 成都 610065
  • 收稿日期:2009-11-10 修回日期:1900-01-01 出版日期:2010-01-15 发布日期:2010-01-15

Research on Regress-base System on Data Mining

MENG Xiao-dong,YUAN Dao-hua,SHI Hui-feng   

  1. College of Computer Science, Sichuan University, Chengdu 610065, China
  • Received:2009-11-10 Revised:1900-01-01 Online:2010-01-15 Published:2010-01-15

摘要: 回归分析是数据挖掘系统中的重要方法之一,本文主要研究如何利用回归模型来进行数据挖掘建模,介绍回归模型的3种类型,基于最小二乘法的参数估计和方程的显著性校验,并提出模型的优化方案,包括离群点的检验处理,模型形式的改进和回归自变量的选取。最后根据实例分析其在数据挖掘中的应用。

关键词: 数据挖掘, 回归分析, 模型优化

Abstract: Regression analysis is one of the important methods in the data mining system. This paper mainly researches how to use the regression models for modeling data mining, introduces the regression model followed by 3 types, the parameter estimation based on the least squares, the significant validation and presents three kinds of optimization algorithm that included the test and deal of disperse extremum, model improvement and the selection of independent variable. Finally, it explains the regression analysis in data mining application through real data.

Key words: data mining, regression analysis, model optimization

中图分类号: