计算机与现代化 ›› 2010, Vol. 1 ›› Issue (11): 60-62.doi: 10.3969/j.issn.1006-2475.2010.11.017

• 管理信息系统 • 上一篇    下一篇

基于XML的Web数据挖掘模型设计与研究

周炘,邓蓉   

  1. 江西师范大学软件学院,江西 南昌 330022
  • 收稿日期:2010-06-23 修回日期:1900-01-01 出版日期:2010-11-25 发布日期:2010-11-25

Design and Research on Web Data Mining Model Based on XML

ZHOU Xin, DENG Rong   

  1. College of Software, Jiangxi Normal University, Nanchang 330022, China
  • Received:2010-06-23 Revised:1900-01-01 Online:2010-11-25 Published:2010-11-25

摘要: 由于互联网上存在大量的信息资源,Web挖掘已成为数据挖掘的热点。本文介绍Web数据挖掘技术,比较HTML和XML的不同,充分利用XML的优越性,提出一种基于XML的数据挖掘模型,并详细论述该模型的特点及用途。

关键词: Web数据挖掘, XML, 信息提取, 网络爬虫

Abstract: Because there are lots of information resources in the Internet, Web mining has become a hotspot in data mining. This paper introduces the technology of Web data mining simply, and compares with HTML and XML. Finally, the paper makes full use of the superiority of XML to propose a data mining model based on XML, and describes the characteristic and purpose of the mining model in detail.

Key words: Web data mining, XML, information extraction, Web crawler

中图分类号: