2006
DOI: 10.1007/11871637_19
|View full text |Cite
|
Sign up to set email alerts
|

Closed Sets for Labeled Data

Abstract: Closed sets are being successfully applied in the context of compacted data representation for association rule learning. However, their use is mainly descriptive. This paper shows that, when considering labeled data, closed sets can be adapted for prediction and discrimination purposes by conveniently contrasting covering properties on positive and negative examples. We formally justify that these sets characterize the space of relevant combinations of features for discriminating the target class. In practice… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
87
0

Year Published

2008
2008
2020
2020

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 42 publications
(88 citation statements)
references
References 11 publications
1
87
0
Order By: Relevance
“…In, e.g., [1,5], the authors mine free itemsets and closed itemsets (i.e., CECs) once the class attribute has been removed from the entire database. Other proposals, e.g., [3,4], consider (closed) itemset mining from samples of each class separately.…”
Section: Freeness or Closedness?mentioning
confidence: 99%
See 3 more Smart Citations
“…In, e.g., [1,5], the authors mine free itemsets and closed itemsets (i.e., CECs) once the class attribute has been removed from the entire database. Other proposals, e.g., [3,4], consider (closed) itemset mining from samples of each class separately.…”
Section: Freeness or Closedness?mentioning
confidence: 99%
“…Indeed, when considering contingency tables (See Tab. 2), for all the studied approaches, f * 1 and f * 0 are known (class distribution). However, if we consider the proposals from [3,4] based on frequent closed itemsets mined per class, we get directly the value f 11 (i.e., f req(X ∪ c, r)) and the value for f 01 can be inferred. Closure equivalence classes in [5] only inform us on f 1 * (i.e., f req(X, r)) and f 0 * .…”
Section: Information and Equivalence Classesmentioning
confidence: 99%
See 2 more Smart Citations
“…Although other interestingness measures can be adapted accordingly, we focus on these two families of generalization-aware measures in this paper, since they are the only ones, which have been described in previous literature and applied in practical applications. We will also not argue about advantages of these functions in comparison to traditional measures or other methods that avoid redundant output, such as closed pattern [12], but focus on efficient mining for these generalization-aware measures by introducing novel, difference-based optimistic estimates.…”
Section: Introductionmentioning
confidence: 99%