基于XML的Web数据挖掘模型设计与研究

doi:10.3969/j.issn.1006-2475.2010.11.017

计算机与现代化 ›› 2010, Vol. 1 ›› Issue (11): 60-62.doi: 10.3969/j.issn.1006-2475.2010.11.017

基于XML的Web数据挖掘模型设计与研究

周炘，邓蓉

江西师范大学软件学院，江西南昌 330022

收稿日期:2010-06-23 修回日期:1900-01-01 出版日期:2010-11-25 发布日期:2010-11-25

Design and Research on Web Data Mining Model Based on XML

ZHOU Xin, DENG Rong

College of Software, Jiangxi Normal University, Nanchang 330022, China

Received:2010-06-23 Revised:1900-01-01 Online:2010-11-25 Published:2010-11-25

摘要/Abstract

摘要： 由于互联网上存在大量的信息资源，Web挖掘已成为数据挖掘的热点。本文介绍Web数据挖掘技术，比较HTML和XML的不同，充分利用XML的优越性，提出一种基于XML的数据挖掘模型，并详细论述该模型的特点及用途。

关键词: Web数据挖掘, XML, 信息提取, 网络爬虫

Abstract: Because there are lots of information resources in the Internet, Web mining has become a hotspot in data mining. This paper introduces the technology of Web data mining simply, and compares with HTML and XML. Finally, the paper makes full use of the superiority of XML to propose a data mining model based on XML, and describes the characteristic and purpose of the mining model in detail.

Key words: Web data mining, XML, information extraction, Web crawler

中图分类号:

TP391

周炘;邓蓉. 基于XML的Web数据挖掘模型设计与研究[J]. 计算机与现代化, 2010, 1(11): 60-62.

ZHOU Xin;DENG Rong. Design and Research on Web Data Mining Model Based on XML[J]. Computer and Modernization, 2010, 1(11): 60-62.

[1]	邱金水, 庄会富, 金涛. 面向海量植物图像的智能检索系统设计[J]. 计算机与现代化, 2022, 0(10): 62-67.
[2]	樊海玮, 秦佳杰, 孙欢, 张丽苗, 鲁芯丝雨. 基于BERT与BiGRU-CRF的交通事故文本信息提取模型[J]. 计算机与现代化, 2022, 0(05): 10-15.
[3]	魏东平，罗丹. 一种基于区间预留编码的XML关键字查询算法[J]. 计算机与现代化, 2019, 0(10): 17-.
[4]	李盼1，李宜广2，徐春1. 基于关键节点的网络热点信息抽取[J]. 计算机与现代化, 2019, 0(09): 60-.
[5]	王亮亮1,2，闫威3，张佳伟1. 新疆昆仑卫星数字专用频道的汉维哈XML解析器[J]. 计算机与现代化, 2017, 0(8): 42-.
[6]	琚兴空. 基于隐马尔科夫模型的网络爬虫检测算法仿真[J]. 计算机与现代化, 2017, 0(4): 122-126.
[7]	钱哨，陈丹. 基于JSON数据格式的飞机协同设计应用适配器[J]. 计算机与现代化, 2016, 0(8): 123-126.
[8]	陈冲,蒋夏军. 一种支持通配符查询的XML模式匹配算法[J]. 计算机与现代化, 2016, 0(4): 65-73.
[9]	王萍，王贺颖. 基于新浪微博的冰雹实况信息挖掘[J]. 计算机与现代化, 2016, 0(3): 24-29+34.
[10]	程光洋1，廉彬2. 基于AdaBoost算法的养老信息筛选及应用[J]. 计算机与现代化, 2016, 0(12): 102-106,110.
[11]	王成勇，杜庆伟，孙静，孙振. 用带权重的pq-gram算法计算XML文档相似度[J]. 计算机与现代化, 2015, 0(3): 20-25.
[12]	林波1，林伟佳2，郭靖羽1，丁东辉2，黄翰2. 基于双层语料过滤器的短语抽取方法[J]. 计算机与现代化, 2015, 0(12): 7-.
[13]	李宗花,张磊. 基于XML Schema的轻量级异构数据集成方法[J]. 计算机与现代化, 2015, 0(11): 93-98.
[14]	林郁峰，陈中育，骆正平，吴星同. 一种基于目标转换的用例建模方法[J]. 计算机与现代化, 2014, 0(7): 40-44.
[15]	杨运平，吴智俊. Apache Shiro安全框架在技术转移服务系统中的应用[J]. 计算机与现代化, 2014, 0(3): 158-160.

基于XML的Web数据挖掘模型设计与研究

Design and Research on Web Data Mining Model Based on XML

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价