2002
DOI: 10.1109/tkde.2002.1000341
|View full text |Cite
|
Sign up to set email alerts
|

Mining sequential patterns with regular expression constraints

Abstract: AbstractÐDiscovering sequential patterns is an important problem in data mining with a host of application domains including medicine, telecommunications, and the World Wide Web. Conventional sequential pattern mining systems provide users with only a very restricted mechanism (based on minimum support) for specifying patterns of interest. As a consequence, the pattern mining process is typically characterized by lack of focus and users often end up paying inordinate computational costs just to be inundated wi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
290
0
2

Year Published

2003
2003
2013
2013

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 224 publications
(293 citation statements)
references
References 6 publications
1
290
0
2
Order By: Relevance
“…Harms et al [12], meanwhile, experiment with closed serial episodes. In another attempt to trim the output, Garofalakis et al [9] proposed a family of algorithms called Spirit which allow the user to define regular expressions that specify the language that the discovered patterns must belong to.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Harms et al [12], meanwhile, experiment with closed serial episodes. In another attempt to trim the output, Garofalakis et al [9] proposed a family of algorithms called Spirit which allow the user to define regular expressions that specify the language that the discovered patterns must belong to.…”
Section: Related Workmentioning
confidence: 99%
“…Consider sequence s = abxywzcbaxywzc and parallel episode X = {a, b, c}. s contains three minimal windows of this episode, namely s [1,7], s [7,9] and s [8,14]. Note that the second minimal window overlaps with both the first and the third, so the disjointwindow frequency of X would be equal to 2.…”
Section: Definition 14mentioning
confidence: 99%
See 1 more Smart Citation
“…indicating that it appears in sequence 3 and starts at timestamp 2. The interval [4,9] means that it represents several occurrences ending from 4 to 9. The gmax value of 7 notifies that occurrences ending from 4 to 7 satisfy the maxGap constraint, while for occurrences ending strictly after timestamp 7 only the prefix of the occurrence satisfies maxGap.…”
Section: Constrained Generalized Occurrencesmentioning
confidence: 99%
“…Independently, the use of user-defined constraints to reduce the search space during sequential pattern extraction has been developed (e.g., [11,9,4,2]). Indeed, it has also been integrated in the occurrence list approach in the cSpade algorithm [13], resulting in one of the most efficient algorithms proposed for constraint-based mining of sequential patterns.…”
Section: Introductionmentioning
confidence: 99%