Proceedings of the 23rd International Conference on Parallel Architectures and Compilation 2014
DOI: 10.1145/2628071.2628079
|View full text |Cite
|
Sign up to set email alerts
|

Bitwise data parallelism in regular expression matching

Abstract: A new parallel algorithm for regular expression matching is developed and applied to the classical grep (global regular expression print) problem. Building on the bitwise data parallelism previously applied to the manual implementation of token scanning in the Parabix XML parser, the new algorithm represents a general solution to the problem of regular expression matching using parallel bit streams. On widelydeployed commodity hardware using 128-bit SSE2 SIMD technology, our algorithm implementations can subst… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…One study [40] presents an experimental evaluation of several FA models (NFAs, DFAs and A-DFAs) on CPUs with large pattern sets, reporting single-stream performance of 40 Mbps on a single-core Xeon E5620 processor. Others exploit SIMD execution on small pattern sets using efficient gather/scatter extension [69] and bit-parallel algorithm [70,71]. Recent work on small FSMs and inputs achieved 1.5 Gbps on a single CPU thread, but the challenges of large, complex FAs and data sets remain unaddressed [63].…”
Section: Related Work and Discussionmentioning
confidence: 99%
“…One study [40] presents an experimental evaluation of several FA models (NFAs, DFAs and A-DFAs) on CPUs with large pattern sets, reporting single-stream performance of 40 Mbps on a single-core Xeon E5620 processor. Others exploit SIMD execution on small pattern sets using efficient gather/scatter extension [69] and bit-parallel algorithm [70,71]. Recent work on small FSMs and inputs achieved 1.5 Gbps on a single CPU thread, but the challenges of large, complex FAs and data sets remain unaddressed [63].…”
Section: Related Work and Discussionmentioning
confidence: 99%
“…The result is returned to the processor. This feature occurs in multiple applications such as database management [17], DNA sequencing [18][19][20], and graph processing [21].…”
Section: B Potential Targeted Applicationsmentioning
confidence: 99%
“…Hardware-based automata can perform simultaneous and parallel exploration of all possible valid paths in an NFA, thereby achieving the processing complexity of a DFA without being subject to DFA state explosion. FPGAs offer the requisite parallelism and a number of relatively recent efforts have explored this direction [5,6,7,8,9]. However, the architectures based on NFA are limited by the capacity of FPGA chips since the transition table is mapped directly into the FPGA logic.…”
Section: Introductionmentioning
confidence: 99%