Computer and Modernization

Previous Articles     Next Articles

Regular Expression Grouping Algorithm Based on Expansion Coefficient

  

  1. (1. School of Computer Science, Chongqing University, Chongqing 400030, China;
    2. Information and Campus Network Management Center, Chongqing University, Chongqing 400030, China;
    3. School of Software, Chongqing University, Chongqing 401331, China)
  • Received:2014-03-10 Online:2014-07-16 Published:2014-07-17

Abstract: Matching speed of det­erministic finite automata (DFA) is far faster than that of the non-deterministic finite state autom­ata (NFA). However, when a large number of regular expressions transform in­to a DFA, the D­FA may consume huge storage space because of state explosion. Firstly, to describe characteristics of the regular expressions, expansion coefficient was defined. Based on the concept of expansion coefficient, a m­ore efficient grou­p­­­ing algorithm which named IGA algorithm was prop­o­sed. IGA algorithm i­solated those regular expressions which easily lead to state explosion, thus saved st­orage space. Experimental results show that, compared with the original algorithm, when the number of groups is same to the original one, IGA algorithm could reduce 25 percents of the number of states averagely.

Key words: expansion coefficient, regular expression, grouping algorithm, DFA

CLC Number: