2015
DOI: 10.1007/978-3-319-18008-3_20
|View full text |Cite
|
Sign up to set email alerts
|

Constraint-Based Sequence Mining Using Constraint Programming

Abstract: Abstract. The goal of constraint-based sequence mining is to find sequences of symbols that are included in a large number of input sequences and that satisfy some constraints specified by the user. Many constraints have been proposed in the literature, but a general framework is still missing. We investigate the use of constraint programming as general framework for this task. We first identify four categories of constraints that are applicable to sequence mining. We then propose two constraint programming fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
54
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(54 citation statements)
references
References 16 publications
0
54
0
Order By: Relevance
“…Research in CSPM has primarily focused on exploiting special properties of constraints, such as monotonicity or antimonotonicity, to guarantee the feasibility of pattern extensions in the mining algorithm (Garofalakis, Rastogi, and Shim 1999;Zaki 2000;Lin and Lee 2005;Bonchi and Lucchese 2005;Chen and Hu 2006;Pei, Han, and Wang 2007;Nijssen and Zimmermann 2014;Mallick, Garg, and Grover 2014;Aoga, Guns, and Schaus 2017). Constraint types that do not possess such properties remain a challenge for CSPM algorithms, although some of these have been successfully incorporated in more general item-set mining on databases where events have no specific order (Soulet and Crémilleux 2005;Bistarelli and Bonchi 2007; (C, 2, 1), (C, 5, 2), (A, 8, 3) 2007; Le Bras, Lenca, and Lallich 2009;Leung et al 2012), as well as in CSPM when items and attributes are interchangeable (Pei, Han, and Wang 2007).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Research in CSPM has primarily focused on exploiting special properties of constraints, such as monotonicity or antimonotonicity, to guarantee the feasibility of pattern extensions in the mining algorithm (Garofalakis, Rastogi, and Shim 1999;Zaki 2000;Lin and Lee 2005;Bonchi and Lucchese 2005;Chen and Hu 2006;Pei, Han, and Wang 2007;Nijssen and Zimmermann 2014;Mallick, Garg, and Grover 2014;Aoga, Guns, and Schaus 2017). Constraint types that do not possess such properties remain a challenge for CSPM algorithms, although some of these have been successfully incorporated in more general item-set mining on databases where events have no specific order (Soulet and Crémilleux 2005;Bistarelli and Bonchi 2007; (C, 2, 1), (C, 5, 2), (A, 8, 3) 2007; Le Bras, Lenca, and Lallich 2009;Leung et al 2012), as well as in CSPM when items and attributes are interchangeable (Pei, Han, and Wang 2007).…”
Section: Related Workmentioning
confidence: 99%
“…in terms of memory requirements and computational time, in particular when the resulting subset of constrained patterns is small in comparison to the full unconstrained set. Constraint-based sequential pattern mining (CSPM) aims at providing more efficient methods by embedding constraint reasoning within existing mining algorithms (Pei, Han, and Wang 2007;Negrevergne and Guns 2015). Nonetheless, while certain constraint types are relatively easy to incorporate in a mining algorithm, others of practical use are still challenging to handle in a general and effective way.…”
Section: Introductionmentioning
confidence: 99%
“…In [Negrevergne and Guns, 2015], the authors organize the constraints on sequential patterns in three categories: 1) constraints on patterns, 2) constraints on patterns embeddings, 3) constraints on pattern sets. These constraints are provided by the user and capture his background knowledge.…”
Section: Alternative Sequential Pattern Mining Tasksmentioning
confidence: 99%
“…In these experiments, we analyse the proposed encodings on processing real datasets. We use the same real datasets as selected in [Negrevergne and Guns, 2015] to have a representative panel of application domains:…”
Section: Real Dataset Analysismentioning
confidence: 99%
See 1 more Smart Citation