Faster Regular Expression Matching

作者: Philip Bille , Mikkel Thorup

DOI: 10.1007/978-3-642-02927-1_16

关键词:

摘要: Regular expression matching is a key task (and often the computational bottleneck) in variety of widely used software tools and applications, for instance, unix grep sed commands, scripting languages such as awk perl , programs analyzing massive data streams, etc. We show how to solve this ubiquitous linear space O (nm (loglogn )/(logn )3/2 + n m ) time where length string. This first improvement dominant /logn term Myers' (n )logn bound [JACM 1992]. also get improved bounds external memory.

参考文章(28)
Bjarne Stroustrup, The C++ programming language (2nd ed.) Addison-Wesley Longman Publishing Co., Inc.. ,(1991)
Ravi Sethi, Jeffrey D. Ullman, Alfred V. Aho, Compilers: Principles, Techniques, and Tools ,(1986)
Stephen Cole Kleene, Representation of Events in Nerve Nets and Finite Automata Princeton University Press. pp. 3- 42 ,(1951) , 10.1515/9781400882618-002
Matthew Hennessy, Robin Milner, On Observing Nondeterminism and Concurrency international colloquium on automata, languages and programming. pp. 299- 309 ,(1980) , 10.1007/3-540-10003-2_79
Zvi Galil, Open Problems in Stringology Springer, Berlin, Heidelberg. pp. 1- 8 ,(1985) , 10.1007/978-3-642-82456-2_1
Philip Bille, New Algorithms for Regular Expression Matching Automata, Languages and Programming. pp. 643- 654 ,(2006) , 10.1007/11786986_56
William J. Masek, Michael S. Paterson, A faster algorithm computing string edit distances Journal of Computer and System Sciences. ,vol. 20, pp. 18- 31 ,(1980) , 10.1016/0022-0000(80)90002-1
Fang Yu, Zhifeng Chen, Yanlei Diao, T. V. Lakshman, Randy H. Katz, Fast and memory-efficient regular expression matching for deep packet inspection Proceedings of the 2006 ACM/IEEE symposium on Architecture for networking and communications systems - ANCS '06. pp. 93- 102 ,(2006) , 10.1145/1185347.1185360
Gonzalo Navarro, NR-grep: a fast and flexible pattern-matching tool Software - Practice and Experience. ,vol. 31, pp. 1265- 1312 ,(2001) , 10.1002/SPE.411
Gonzalo Navarro, Mathieu Raffinot, Fast and simple character classes and bounded gaps pattern matching, with applications to protein searching. Journal of Computational Biology. ,vol. 10, pp. 903- 923 ,(2003) , 10.1089/106652703322756140