1992
DOI: 10.1002/spe.4380221006
|View full text |Cite
|
Sign up to set email alerts
|

Tuning the boyer‐moore‐horspool string searching algorithm

Abstract: Substring search is a common activity in computing. The fastest known search method is that of Boyer and Moore with the improvements introduced by Horspool. This paper presents a new implementation which takes advantage of the dependencies between the characters. The resulting code runs 25 per cent faster than the best currently‐known routine.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
33
0
1

Year Published

1998
1998
2022
2022

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 66 publications
(34 citation statements)
references
References 9 publications
0
33
0
1
Order By: Relevance
“…In fact, they proposed two algorithms, one for presentation and another which performs best in practical applications (Hume and Sunday, 1991). Since then, some improvements or modifications have been added and parts of the algorithm have been proposed as non beneficial in certain situation (Hume and Sunday, 1991, Horspool, 1980, Raita, 1992. In general, the algorithm and its variants build on the idea that comparing a pattern from its rightmost end to the text in question allows for larger shifts which can be precomputed from the pattern.…”
Section: Boyer-moore String Matching Algorithmmentioning
confidence: 99%
See 1 more Smart Citation
“…In fact, they proposed two algorithms, one for presentation and another which performs best in practical applications (Hume and Sunday, 1991). Since then, some improvements or modifications have been added and parts of the algorithm have been proposed as non beneficial in certain situation (Hume and Sunday, 1991, Horspool, 1980, Raita, 1992. In general, the algorithm and its variants build on the idea that comparing a pattern from its rightmost end to the text in question allows for larger shifts which can be precomputed from the pattern.…”
Section: Boyer-moore String Matching Algorithmmentioning
confidence: 99%
“…This is even more pronounced when a large number of matches are expected in the text or if the suffix of the pattern is abundant in the text. Raita created a variant of the BMH algorithm which introduced sentinels in order to speed up searches by first comparing the parts of the pattern with the weakest dependencies (Raita, 1992). He reported an improvement of approximately 25% over the BMH algorithm but it has been shown by Smith to be solely due to sentinel use, as opposed to character dependencies within the pattern, as Raita concluded (Smith, 1994).…”
Section: Introductionmentioning
confidence: 99%
“…The major advantage of this method is flexibility in adapting to different edit distance functions. The Raita algorithm [10] utilizes the same approach as Horspool algorithm [6] to obtaining the shift value after an attempt. Instead of comparing each character in the pattern with the sliding window from right to left, the order of comparison in Raita algorithm [10] is carried out by first comparing the rightmost and leftmost characters of the pattern with the sliding window.…”
Section: Cpc-character Per Comparison Ratiomentioning
confidence: 99%
“…String matching [2] algorithms are used in most of the real world applications where pattern extraction is required like as Intrusion Detection system [3][4], Plagiarism detection [5], Data Mining [6] and Bioinformatics [7]. Bit parallel algorithms are faster than the other benchmark character based algorithms like as KMP [4] [8], BM [9][10], BMH [11] [12], BMHS [13], BMHS2 [14], BMI [15], Improved BMHS [16], Cmmentz Walter [17][18], Wu Manber [19] [20] and Aho-Corasick [21] [22] etc. Bit Parallel algorithms [23] are based on the non deterministic automata but there is no such automata are present.…”
Section: Introductionmentioning
confidence: 99%