2017
DOI: 10.1016/j.eswa.2016.10.012
|View full text |Cite
|
Sign up to set email alerts
|

A framework for redescription set construction

Abstract: Redescription mining is a field of knowledge discovery that aims at finding different descriptions of similar subsets of instances in the data. These descriptions are represented as rules inferred from one or more disjoint sets of attributes, called views. As such, they support knowledge discovery process and help domain experts in formulating new hypotheses or constructing new knowledge bases and decision support systems. In contrast to previous approaches that typically create one smaller set of redescriptio… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 15 publications
(19 citation statements)
references
References 37 publications
(93 reference statements)
0
19
0
Order By: Relevance
“…In future work, we intend to extend the applicability of the tool for exploration of other kinds of rule sets, such as association rules and subgroups. The tool will also be extended to work with different redescription set construction techniques (Kalofolias et al 2016;Mihelčić et al 2017a), and enable the use of statistical tests within different aspects to facilitate the discovery process. One potentially interesting direction of future research is to explore the potential of developed algorithms and adapted visualizations in an interactive redescription mining setting.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…In future work, we intend to extend the applicability of the tool for exploration of other kinds of rule sets, such as association rules and subgroups. The tool will also be extended to work with different redescription set construction techniques (Kalofolias et al 2016;Mihelčić et al 2017a), and enable the use of statistical tests within different aspects to facilitate the discovery process. One potentially interesting direction of future research is to explore the potential of developed algorithms and adapted visualizations in an interactive redescription mining setting.…”
Section: Discussionmentioning
confidence: 99%
“…Our approach provides a framework that allows incorporating ideas from various pattern set mining and pattern selection approaches, designed to create smaller, compressed or userpreferred set of patterns, into redescription set mining and exploration process. Pattern set mining approaches, such as Xin et al 2005 2017can be used to construct sets of new redescriptions describing the selected subsets of entities/attributes of interest, whereas pattern selection methods (Xin et al 2005;Knobbe and Ho 2006;Pei et al 2007;Ouali et al 2017;Kalofolias et al 2016;Mihelčić et al 2017a) can be used to perform automated and fine-grained selection of redescriptions from a selected subset of redescriptions. Techniques for performing redescription mining with entity/attribute constraints already exist (Zaki and Ramakrishnan 2005;Mihelčić et al 2017b).…”
Section: Specifics and Motivation For Developing Intersetmentioning
confidence: 99%
See 1 more Smart Citation
“…Choice of redescription accuracy measure. Since the data contains missing values, we used the query non-missing Jaccard index, introduced in [30], and further explained in [45] to evaluate redescriptions. The query non missing Jaccard index is defined as:…”
Section: Choice Of Methodology Redescription Accuracy Measure and A mentioning
confidence: 99%
“…As a part of the related work, we present a slightly modified generalized version of the CLUS-RM algorithm [27] which we call the GCLUS-RM. The generalised CLUS-RM algorithm (GCLUS-RM), presented in Algorithm 1, contains memory constraints on the maximal size of the redescription set and allows using an arbitrary, rule-transformable, model M obtained using some multi-target regression (multi-label classification), machine learning algorithm Alg (lines 1 to 10).…”
Section: A the Gclus-rm Algorithmmentioning
confidence: 99%