11th International Database Engineering and Applications Symposium (IDEAS 2007) 2007
DOI: 10.1109/ideas.2007.4318095
|View full text |Cite
|
Sign up to set email alerts
|

Contrasting the Contrast Sets: An Alternative Approach

Abstract: The need to identify significant differences between contrasting groups or classes is ubiquitous and thus was the focus of many statisticians and data miners. Contrast sets, conjunctions of attribute-value pairs significantly more frequent in one group than another, were proposed to describe such differences, which lead to the introduction of a new data mining techniquecontrast-set mining. A number of attempts have been made in this regard by various authors; however, no clear picture seems to have emerged. In… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2008
2008
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 7 publications
0
5
0
Order By: Relevance
“…Formally, a PCS between two groups is one that satisfies the condition: max(support(cset,G 0 ), support(cset,G 1 )) ≥ minSup (adapted from Satsangi and Zaiane [35]). A significant contrast set (SCS) is a PCS that also meets the significance condition.…”
Section: Contrast Setsmentioning
confidence: 99%
See 1 more Smart Citation
“…Formally, a PCS between two groups is one that satisfies the condition: max(support(cset,G 0 ), support(cset,G 1 )) ≥ minSup (adapted from Satsangi and Zaiane [35]). A significant contrast set (SCS) is a PCS that also meets the significance condition.…”
Section: Contrast Setsmentioning
confidence: 99%
“…2 When the itemset was missing from one of the datasets (i.e., it was not frequent), then the count of support and non-support was unknown. In such a case the support frequency for the contingency table (S Gij or S Nij ) was calculated [35] according to the supCount formula: Following Bay and Pazzani [5], if a PCS was found to be significant at α=0.05, then the patch it represents was deemed valuable (see online appendix for a sensitivity analysis). A valuable patch which was predominately visited by users from the goal group was known as a goal patch and placed in the set P Gi , whereas visitation mostly from the non-goal group resulted in a patch being labeled as a non-goal patch and being placed in the set P Ni .…”
Section: Determining Patch Valuementioning
confidence: 99%
“…Zaiane and his collegues Satsangi and Zaiane 2007) have worked on discovering so called contrast sequences. This is similar to a single stage of the interactive model building process described here, except that two datasets are compared, while in our approach one of the datasets is replaced by an HMM.…”
Section: Related Researchmentioning
confidence: 99%
“…Kralj et al (2007a, b) performed contrast set mining through subgroup discovery (Lavrac et al 2004). Satsangi and Zaiane (2007) use the association rule-based approach to find contrast sets. At the same time, STUCCO has been used to address real-world problems in many fields, such as education (Minaei-Bidgoli et al 2004), medical research (Kralj et al 2007a, b;Siu et al 2005), and the aviation industry (Nazeri et al 2008).…”
Section: Related Workmentioning
confidence: 99%