Proceedings of the 2003 SIAM International Conference on Data Mining 2003
DOI: 10.1137/1.9781611972733.36
|View full text |Cite
|
Sign up to set email alerts
|

ApproxMAP: Approximate Mining of Consensus Sequential Patterns

Abstract: Sequential pattern mining is an important data mining task with broad applications. However, conventional methods may meet inherent difficulties in mining databases with long sequences and noise. They may generate a huge number of short and trivial patterns but fail to find interesting patterns approximately shared by many sequences. To attack these problems, in this paper, we propose the theme of approximate sequential pattern mining roughly defined as identifying patterns approximately shared by many sequenc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
40
0

Year Published

2005
2005
2018
2018

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 73 publications
(40 citation statements)
references
References 26 publications
0
40
0
Order By: Relevance
“…Much of the data in these databases is in the form of sequences and we often seek sequences that occur frequently or are "centrally located" or form a "consensus pattern." An example arises in social welfare data [245,246], where an algorithm for finding approximate sequential consensus patterns, ones that appear frequently in a database, is discussed. A similar problem arises in molecular biology, when we seek to choose sequences that occur frequently or are "centrally located" in a database of molecular sequences.…”
Section: Large Databases and Inferencementioning
confidence: 99%
“…Much of the data in these databases is in the form of sequences and we often seek sequences that occur frequently or are "centrally located" or form a "consensus pattern." An example arises in social welfare data [245,246], where an algorithm for finding approximate sequential consensus patterns, ones that appear frequently in a database, is discussed. A similar problem arises in molecular biology, when we seek to choose sequences that occur frequently or are "centrally located" in a database of molecular sequences.…”
Section: Large Databases and Inferencementioning
confidence: 99%
“…This combination may generate large sequences. However, mining sequential patterns is inefficient with long sequences and often may not find exact matching of long patterns in the database (Kum et al 2003). Due to these shortcomings, an improved MDSPM technique is required to mine patterns without loosing important information carried by the dimensions.…”
Section: Introductionmentioning
confidence: 99%
“…It is one of the essential data mining tasks widely used in many applications, including customer purchase pattern analysis and biological data sequences [17][18][19][20][21][22], etc. Many research have been performed to efficient sequential pattern mining, such as [23][24][25], closed and maximal sequential pattern mining [26][27][28][29], constraint-based sequential pattern mining [30][31][32] approximate sequential pattern mining [33], sequential pattern mining in multiple data sources [34], sequential pattern mining in noisy data [35], incremental mining of sequential patterns [36], and time-interval weighted sequential pattern mining [37]. Two of the general sequential mining algorithms are SPADE [24] and PrefixSpan [23], which are more efficient than others in terms of processing time.…”
Section: Introductionmentioning
confidence: 99%