Proceedings of the 2014 SIAM International Conference on Data Mining 2014
DOI: 10.1137/1.9781611973440.81
|View full text |Cite
|
Sign up to set email alerts
|

ROCsearch — An ROC-guided Search Strategy for Subgroup Discovery

Abstract: Subgroup Discovery (SD) aims to find coherent, easy-to-interpret subsets of the dataset at hand, where something exceptional is going on. Since the resulting subgroups are defined in terms of conditions on attributes of the dataset, this data mining task is ideally suited to be used by non-expert analysts. The typical SD approach uses a heuristic beam search, involving parameters that strongly influence the outcome. Unfortunately, these parameters are often hard to set properly for someone who is not a data mi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(17 citation statements)
references
References 16 publications
0
17
0
Order By: Relevance
“…This beam only keeps the most promising subgroups to extend at each level [ [36,44,53]]. The redundancy issue due to the beam search is tackled with the pattern skyline paradigm by [54], and with a ROC-based beam search variant for SD by [42]. Another family of SD algorithms relies on evolutionary approaches.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…This beam only keeps the most promising subgroups to extend at each level [ [36,44,53]]. The redundancy issue due to the beam search is tackled with the pattern skyline paradigm by [54], and with a ROC-based beam search variant for SD by [42]. Another family of SD algorithms relies on evolutionary approaches.…”
Section: Related Workmentioning
confidence: 99%
“…Several solutions have been proposed to filter out redundant subgroups, e.g. as did [12,53,42,10]. Basically, a neighboring function enables to keep only local optima.…”
Section: Introductionmentioning
confidence: 99%
“…From a tool perspective, there exist several software packages for subgroup discovery, e.g., Refs . As open source options, there are, for example, subgroup discovery modules for the data mining systems Orange and RapidMiner , the Cortana system for discovering local patterns in data, as well as the specialized subgroup discovery and analytics system VIKAMINE . Using the latter a number of successful real‐world subgroup discovery applications have been implemented.…”
Section: Tools and Applicationsmentioning
confidence: 99%
“…In addition, the integration of (rich) background knowledge in a knowledge‐intensive approach, e.g., Refs , is a prerequisite for the analysis of large datasets for which relations and prior information needs to be utilized. This also tackles the area of automatic subgroup discovery, recent search strategies, e.g., Ref , and the applied significance filtering in many methods and tools . For a full‐scale approach, these issues need to be addressed such that suitable methods can be integrated comprehensively, from automatic to interactive approaches, e.g., Refs , which can also be applied for generating appropriate explanations .…”
Section: Future Directions and Challengesmentioning
confidence: 99%
“…When the features serve as onedimensional subgroups on the first level of the search lattice, the entire realizations of A are used. For one-dimensional refinements, only those realizations covered by the subgroup in consideration are used [13,20]. For readability, we keep our discussion to the first case.…”
Section: Mining Binary Featuresmentioning
confidence: 99%