Proceedings of the 22nd ACM International Conference on Information &Amp; Knowledge Management 2013
DOI: 10.1145/2505515.2505577
|View full text |Cite
|
Sign up to set email alerts
|

Boolean satisfiability for sequence mining

Abstract: In this paper, we propose a SAT-based encoding for the problem of discovering frequent, closed and maximal patterns in a sequence of items and a sequence of itemsets. Our encoding can be seen as an improvement of the approach proposed in [8] for the sequences of items. In this case, we show experimentally on real world data that our encoding is significantly better. Then we introduce a new extension of the problem to enumerate patterns in a sequence of itemsets. Thanks to the flexibility and to the declarative… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(12 citation statements)
references
References 27 publications
0
12
0
Order By: Relevance
“…This would require adding rewrite rules for specific encodings of the constraints into SAT/ASP. Other pattern mining settings that have been studied in a CP framework such as sequences with wildcards [47,48,49] and sequential patterns [50] can be expressed in MiningZinc too. The main difference is in the definition of the cover relation, and one could add such relations to the MiningZinc library as reusable functions.…”
Section: Related Workmentioning
confidence: 99%
“…This would require adding rewrite rules for specific encodings of the constraints into SAT/ASP. Other pattern mining settings that have been studied in a CP framework such as sequences with wildcards [47,48,49] and sequential patterns [50] can be expressed in MiningZinc too. The main difference is in the definition of the cover relation, and one could add such relations to the MiningZinc library as reusable functions.…”
Section: Related Workmentioning
confidence: 99%
“…More recently, [9], [24] have proposed a SAT-Based approach for discovering frequent, closed and maximal sequential patterns with wildcards in only a single sequence of items or itemsets. However, unlike [9], [24], our approach considers a sequence database of items. Moreover, our approach allows to consider a broader set of constraints that are not handled in [9] (e.g.…”
Section: Dynamic Csp For Mining Richer Sequence Patternsmentioning
confidence: 99%
“…However, unlike [9], [24], our approach considers a sequence database of items. Moreover, our approach allows to consider a broader set of constraints that are not handled in [9] (e.g. gap, regular expression, top-k and relevant subgroup constraints).…”
Section: Dynamic Csp For Mining Richer Sequence Patternsmentioning
confidence: 99%
See 1 more Smart Citation
“…Examples include unbounded model checking [9], reachability analysis [5], prime implicant generation [11], and so on. In recent years, a data mining framework based on the reduction to an ALL-SAT problem has been proposed [7]. Most ALL-SAT solvers are implemented on top of a SAT solver that searches one solution.…”
Section: Introductionmentioning
confidence: 99%