Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '03 2003
DOI: 10.1145/956755.956781
|View full text |Cite
|
Sign up to set email alerts
|

On detecting differences between groups

Abstract: Understanding the differences between contrasting groups is a fundamental task in data analysis. This realization has led to the development of a new special purpose data mining technique, contrast-set mining. We undertook a study with a retail collaborator to compare contrast-set mining with existing rule-discovery techniques. To our surprise we observed that straightforward application of an existing commercial rule-discovery system, Magnum Opus, could successfully perform the contrast-set-mining task. This … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
38
0

Year Published

2006
2006
2018
2018

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(38 citation statements)
references
References 3 publications
0
38
0
Order By: Relevance
“…Hence, completeness can only be ensured by a post-pruning process. The same problem was identified in Webb et al (2003) using Magnum Opus. CAREN implements two versions of this pruning process.…”
Section: Definitionmentioning
confidence: 60%
“…Hence, completeness can only be ensured by a post-pruning process. The same problem was identified in Webb et al (2003) using Magnum Opus. CAREN implements two versions of this pruning process.…”
Section: Definitionmentioning
confidence: 60%
“…Work in (Bay & Pazzani, 1999, 2001Webb et al, 2003) focus on mining contrast sets: conjunctions of attributes and values that differ meaningfully in their distribution across groups. Those allow us to answer queries of the form, ''How are History and Computer Science students different?"…”
Section: Related Workmentioning
confidence: 99%
“…Furthermore, software companies can devise well-performed antispam email systems based on these differences. Therefore, there are some researches reported on mining group differences between contrast groups from observational multivariate data (Bay & Pazzani, 1999, 2001Webb, Butler, & Newlands, 2003).…”
Section: Introductionmentioning
confidence: 99%
“…Some particular data mining techniques, known as contrast-set mining (Bay and Pazzani, 2001;Dong and Li, 1999;Webb et al, 2003), have been designed specifically to identify differences between databases to be contrasted.…”
Section: Related Workmentioning
confidence: 99%
“…A similar strategy is also used in STUCCO (Bay and Pazzani, 2001) to obtain characteristic itemsets in one database based on the w 2 test. In addition, Magnum Opus (Webb et al, 2003) examines relations between itemsets and a database from several databases. On the other hand, this paper seeks paired itemsets whose correlations radically increase in one database.…”
Section: Related Workmentioning
confidence: 99%