Computer and Modernization ›› 2013, Vol. 1 ›› Issue (5): 172-175.doi: 10.3969/j.issn.1006-2475.2013.05.041

• 应用与开发 • Previous Articles     Next Articles

Research on Web Biological Information Extraction Method Based on Ontology

HE Yuan   

  1. School of Information Science and Technology, HNAU, Changsha 410128, China
  • Received:2013-01-06 Revised:1900-01-01 Online:2013-05-28 Published:2013-05-28

Abstract: Aiming at the malpractice in traditional search field based on keyword and data retrieval, this paper proposes a Web information extraction framework based on ontology. Firstly, the framework obtains the Web page which is converted into a wellformed HTML document, secondly, the document is turned into the DOM tree by making use of the HTML parser, then, the extraction rules is achieved on the basis of the users’ interest data block which is obtained according to the XPath expression. Finally, the data is extracted through the OntPMatch algorithm, and is stored in RDF data format. The paper makes the empirical study using the cotton information as research object, and realizes a prototype system of extracting biological information data. The paper provides a useful tool for users to obtain valuable biological information from Web.

Key words: ontology, Web, information extraction

CLC Number: