2007
DOI: 10.1007/978-3-540-73599-1_12
|View full text |Cite
|
Sign up to set email alerts
|

Contrast Set Mining for Distinguishing Between Similar Diseases

Abstract: Abstract. The task addressed and the method proposed in this paper aim at improved understanding of differences between similar diseases. In particular we address the problem of distinguishing between thrombolic brain stroke and embolic brain stroke as an application of our approach of contrast set mining through subgroup discovery. We describe methodological lessons learned in the analysis of brain ischaemia data and a practical implementation of the approach within an open source data mining toolbox.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2010
2010
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(9 citation statements)
references
References 8 publications
0
9
0
Order By: Relevance
“…Mining EPs has been well studied since 1999 , and the implementations of EPs in many areas are found in the literature as well. For example, implementations in the medical area can distinguish similar diseases , classify cancer diagnosis data , and profile leukemia patients . In the networking environment, EPs are implemented to find significant differences in network data streams , or to detect masqueraders .…”
Section: Related Workmentioning
confidence: 99%
“…Mining EPs has been well studied since 1999 , and the implementations of EPs in many areas are found in the literature as well. For example, implementations in the medical area can distinguish similar diseases , classify cancer diagnosis data , and profile leukemia patients . In the networking environment, EPs are implemented to find significant differences in network data streams , or to detect masqueraders .…”
Section: Related Workmentioning
confidence: 99%
“…However, these methods focus only on maximizing classification accuracy and aim to achieve interpretability just by building the model from rules. Similarly, algorithms for problems such as subgroup discovery [24, 33, 40, 46], contrast set learning [3, 4, 30], and emerging pattern mining [17, 19] identify sets of rules to describe the relationships among variables and discover interesting patterns in the data. In contrast, our work explicitly defines an objective function that scores interpretability and accuracy, and by optimizing it, we find a globally near-optimal model.…”
Section: Related Workmentioning
confidence: 99%
“…This is referred to as the one versus all approach. An alternative approach, called round robin, which uses a set of 2 × 2 contingency tables representing all possible pairs of groups, has been used previously for contrast set mining [3], however subsequent research that experimented with both the round robin and one versus all approaches found that the round robin approach was not appropriate when looking for differences between two similar groups [16]. Formally, with the one versus all approach, for a contrast set X, where ∃iP (X|G i ), we determine…”
Section: Comparison Of Contrasting Groupsmentioning
confidence: 99%