Machine Learning and Data Mining in Pattern Recognition
DOI: 10.1007/3-540-45065-3_21
|View full text |Cite
|
Sign up to set email alerts
|

Generalization of Pattern-Growth Methods for Sequential Pattern Mining with Gap Constraints

Abstract: Abstract. The problem of sequential pattern mining is one of the several that has deserved particular attention on the general area of data mining. Despite the important developments in the last years, the best algorithm in the area (PrefixSpan) does not deal with gap constraints and consequently doesn't allow for the introduction of background knowledge into the process. In this paper we present the generalization of the PrefixSpan algorithm to deal with gap constraints, using a new method to generate project… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
40
0

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 41 publications
(41 citation statements)
references
References 5 publications
1
40
0
Order By: Relevance
“…Given two sequences α=<a 1 a 2 …a n > and β=<b 1 b 2 …b m > where α is called a subsequence of β, denoted as α⊆ β, if there exist integers 1≤j 1 <j 2 <…<j n ≤m such that a 1 ⊆b j1 , a 2 ⊆b j2 ,…,a n ⊆b jn. Here if α and β have the following sequences α=<(xy), t> and β=< (xyz), (zt)>, β is denoted as a super sequence of α [2,6]. In addition to the discovery of recurrent itemsets, sequential pattern mining requires the arrangement of those itemsets in a sequence.…”
Section: Literature Reviewmentioning
confidence: 99%
See 2 more Smart Citations
“…Given two sequences α=<a 1 a 2 …a n > and β=<b 1 b 2 …b m > where α is called a subsequence of β, denoted as α⊆ β, if there exist integers 1≤j 1 <j 2 <…<j n ≤m such that a 1 ⊆b j1 , a 2 ⊆b j2 ,…,a n ⊆b jn. Here if α and β have the following sequences α=<(xy), t> and β=< (xyz), (zt)>, β is denoted as a super sequence of α [2,6]. In addition to the discovery of recurrent itemsets, sequential pattern mining requires the arrangement of those itemsets in a sequence.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Let s represents minimum support threshold for mining the database and let m =|C|. Then aim of mining frequently occurring itemset is to discover recurrent itemsets among | I s | different possible itemsets as represented in equation (i) below [6]:…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…In [7], an algorithm was developed for a text collection, which is different from finding all the MFS into a single text. The algorithms for getting all MFS can be classified as Apriori-based (typical) and Pattern-growth methods [8].…”
Section: Related Workmentioning
confidence: 99%
“…Application of this approach to the treatment of sequential data results in one important special case. The process of finding all sub-sequences that occur often on a specified sequence database and have minimum support threshold is known as sequential pattern mining [1]. Data is normally assumed to be centralized, memory-resident, and static by conventional methods for sequential mining.…”
Section: Introductionmentioning
confidence: 99%