Computer and Modernization ›› 2011, Vol. 1 ›› Issue (8): 39-41.doi: 10.3969/j.issn.1006-2475.2011.08.011

• 软件工程 • Previous Articles     Next Articles

Research on Transforming HTML into XML

QIAN Cheng, YANG Xiao-lan   

  1. College of Information Engineering, Zhongnan Branch, Wuhan University of Science and Technology, Wuhan 430223, China
  • Received:2011-04-07 Revised:1900-01-01 Online:2011-08-10 Published:2011-08-10

Abstract: Most of the information on the network is programmed in HTML, but the HTML language itself has shortages, so it can not deal with many demands on the network. XML can make up for the lack of HTML, therefore, traditional data network applications and transform XML markup data is becoming increasingly important. In this paper, the conversion from HTML to XML technologies is researched, and the conversion system is implemented in Java language.

Key words: HTML, XML, parser, information extraction, JAXB

CLC Number: