基于本体的Web生物信息抽取方法研究

doi:10.3969/j.issn.1006-2475.2013.05.041

计算机与现代化 ›› 2013, Vol. 1 ›› Issue (5): 172-175.doi: 10.3969/j.issn.1006-2475.2013.05.041

基于本体的Web生物信息抽取方法研究

何源

湖南农业大学信息科学技术学院，湖南长沙410128

收稿日期:2013-01-06 修回日期:1900-01-01 出版日期:2013-05-28 发布日期:2013-05-28

Research on Web Biological Information Extraction Method Based on Ontology

HE Yuan

School of Information Science and Technology, HNAU, Changsha 410128, China

Received:2013-01-06 Revised:1900-01-01 Online:2013-05-28 Published:2013-05-28

摘要/Abstract

摘要： 针对传统的基于关键词的搜索与数据检索存在的弊端，本文提出基于本体的Web信息抽取框架。该框架首先获取Web页面，将其转换为格式良好的HTML文档，然后利用HTML解析器将该文档转化为DOM树，再根据XPath表达式获取用户感兴趣的数据块，由此生成抽取规则，最后通过OntPMatch算法实现数据的抽取，并以RDF数据格式储存信息。本文以棉花信息为研究对象加以实证研究，实现Web生物信息数据抽取原型系统，为方便用户发现有价值的Web生物信息资源提供一个有效的工具。

关键词: 本体, Web, 信息抽取

Abstract: Aiming at the malpractice in traditional search field based on keyword and data retrieval, this paper proposes a Web information extraction framework based on ontology. Firstly, the framework obtains the Web page which is converted into a wellformed HTML document, secondly, the document is turned into the DOM tree by making use of the HTML parser, then, the extraction rules is achieved on the basis of the users’ interest data block which is obtained according to the XPath expression. Finally, the data is extracted through the OntPMatch algorithm, and is stored in RDF data format. The paper makes the empirical study using the cotton information as research object, and realizes a prototype system of extracting biological information data. The paper provides a useful tool for users to obtain valuable biological information from Web.

Key words: ontology, Web, information extraction

中图分类号:

TP311

何源. 基于本体的Web生物信息抽取方法研究[J]. 计算机与现代化, 2013, 1(5): 172-175.

HE Yuan. Research on Web Biological Information Extraction Method Based on Ontology[J]. Computer and Modernization, 2013, 1(5): 172-175.

[1]	李璐, 朱焱. 基于知识提示微调的事件抽取方法[J]. 计算机与现代化, 2024, 0(07): 36-40.
[2]	乔璐, 孙有朝, 吴红兰. 面向飞机故障文本的信息抽取[J]. 计算机与现代化, 2024, 0(03): 61-66.
[3]	刘甫, 余劲松弟, 魏丹丹, . 基于北斗网格的影像数据REST Web服务系统[J]. 计算机与现代化, 2023, 0(11): 108-112.
[4]	杨柳青, 王冲. 基于极大熵的Web服务资源个性化推荐方法[J]. 计算机与现代化, 2023, 0(09): 32-37.
[5]	王坭, 王淑营, 史海欧, 袁泉. 基于三角剖分算法的BIM模型高精度显示方法[J]. 计算机与现代化, 2021, 0(09): 57-62.
[6]	朱岩, 张利, 王煜. 基于RoBERTa-WWM的中文电子病历命名实体识别[J]. 计算机与现代化, 2021, 0(02): 51-55.
[7]	刘梦超, 王玉玫, 吴亚非, 臧义华, 梁佳. 基于本体的军事装备知识建模及分析[J]. 计算机与现代化, 2021, 0(01): 76-80.
[8]	张艳, 杨芳, 杨蕾, 韩奎国, 李辉. 基于知识图谱的区块链技术及电力行业应用分析[J]. 计算机与现代化, 2020, 0(12): 55-60.
[9]	李华莹,刘丽,刘怡静. 面向软件生态的资源定位技术[J]. 计算机与现代化, 2020, 0(03): 24-.
[10]	赵琦1,2,蒋朝惠1,2,周雪梅1,2,宋紫华1,2. 一种基于HTTP协议的隐蔽隧道及其检测方法[J]. 计算机与现代化, 2019, 0(06): 16-.
[11]	刘锋1,李淑芝2,邹臣嵩1. 物联网环境下基于情境的语义Web服务选择[J]. 计算机与现代化, 2019, 0(06): 38-.
[12]	高强1，胡强2. 基于Petri网的服务流程结构健壮性判定[J]. 计算机与现代化, 2018, 0(10): 122-.
[13]	程树东，胡鹰. 基于BI-LSTM-CRF模型的限定领域知识库问答系统[J]. 计算机与现代化, 2018, 0(07): 53-.
[14]	江东宇1，康达周1,2，王顺1. 基于本体的概念体系结构导出的一致性验证[J]. 计算机与现代化, 2017, 0(8): 84-.
[15]	高菲. 基于P2P的浏览器缓存协作系统[J]. 计算机与现代化, 2017, 0(8): 98-.

基于本体的Web生物信息抽取方法研究

Research on Web Biological Information Extraction Method Based on Ontology

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价