2003
DOI: 10.1007/978-1-4613-0231-5_1
|View full text |Cite
|
Sign up to set email alerts
|

Correcting the Training Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2003
2003
2013
2013

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(11 citation statements)
references
References 20 publications
0
11
0
Order By: Relevance
“…In a previous work [2], a methodology for correcting a TS while employing nonparametric classifiers has been presented. The Decontamination procedure can be regarded as a cleaning process removing some elements of the TS and correcting the label of several others while retaining them.…”
Section: Proposed Strategiesmentioning
confidence: 99%
See 1 more Smart Citation
“…In a previous work [2], a methodology for correcting a TS while employing nonparametric classifiers has been presented. The Decontamination procedure can be regarded as a cleaning process removing some elements of the TS and correcting the label of several others while retaining them.…”
Section: Proposed Strategiesmentioning
confidence: 99%
“…In the present paper, we introduce a new proposal for balancing the TS through reduction of the majority class size and, at the same time, an increase in the amount of prototypes in the minority class. To this aim, we employ a modification of the Decontamination methodology [2] that will be referred to as Restricted Decontamination. We also explore the convenience of using this technique in combination with a weighted distance measure aimed at biasing the classification procedure.…”
Section: Introductionmentioning
confidence: 99%
“…The Generalized Editing Algorithm consists of removing some suspicious prototypes and changing the class labels of some other instances. Accordingly, it can be regarded as a technique for modifying the structure of the training sample (through re-labeling of some training instances) and not only for eliminating atypical instances [20]. The second algorithm is proposed taking into account these ideas.…”
Section: Two Methods For Editing Training Set Based On Rough Setsmentioning
confidence: 99%
“…This technique consists in applying the k-NN (k > 1) classifier to estimate the class label of every prototype in the training set and discard those instances whose class label does not agree with the class associated to the majority of the k neighbors. The benefitsimprovements of the generalization accuracy-of Wilson's algorithm have been supported by theoretical and empirical evaluations [20]. The Repeated ENN (RENN) applies the ENN algorithm repeatedly until all instances remaining have a majority of their neighbors with the same class, which continues to widen the gap between classes and smoothes the decision boundary.…”
Section: About Editing Training Setsmentioning
confidence: 99%
See 1 more Smart Citation