2011
DOI: 10.1007/978-3-642-23291-6_6
|View full text |Cite
|
Sign up to set email alerts
|

On Dataset Complexity for Case Base Maintenance

Abstract: Abstract.We present what is, to the best of our knowledge, the first analysis that uses dataset complexity measures to evaluate case base editing algorithms. We select three different complexity measures and use them to evaluate eight case base editing algorithms. While we might expect the complexity of a case base to decrease, or stay the same, and the classification accuracy to increase, or stay the same, after maintenance, we find many counter-examples. In particular, we find that the RENN noise reduction a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 11 publications
0
5
0
Order By: Relevance
“…F1 uses the largest discriminating ratio among all the dimensions provided by the encoding method, indicating if the problem classes can be separable using this high-discriminant feature. F2 is related to the overlapping intervals between the problem classes [10]. The Average Number of Principal Component Analysis (PCA) dimensions compared to the original dimensions (T4) can be used for evaluating dimensionality [20].…”
Section: Introductionmentioning
confidence: 99%
“…F1 uses the largest discriminating ratio among all the dimensions provided by the encoding method, indicating if the problem classes can be separable using this high-discriminant feature. F2 is related to the overlapping intervals between the problem classes [10]. The Average Number of Principal Component Analysis (PCA) dimensions compared to the original dimensions (T4) can be used for evaluating dimensionality [20].…”
Section: Introductionmentioning
confidence: 99%
“…Before the instance level was addressed, complexity measures were applied in tasks like IS [18,11] or to assess whether it is worthwhile to apply a noise filter to the instances or not [26]. However, the instance perspective of data complexity has fostered their use in tasks related to IS like noise filter or data sampling.…”
Section: State-of-the-artmentioning
confidence: 99%
“…(3) Reachability(a ∈ A) = {a ′ ∈ A : solves(a ′ , a)}, with the unedited original training data [15]. CNN is further extended as reduced edited nearest neighbor method in [8]. The noisy cases are removed, which belong to a class different than the majority of their NN's [8].…”
Section: Related Workmentioning
confidence: 99%
“…CNN is further extended as reduced edited nearest neighbor method in [8]. The noisy cases are removed, which belong to a class different than the majority of their NN's [8].…”
Section: Related Workmentioning
confidence: 99%