2009
DOI: 10.2174/138920209789208228
|View full text |Cite
|
Sign up to set email alerts
|

Classification and Error Estimation for Discrete Data

Abstract: Discrete classification is common in Genomic Signal Processing applications, in particular in classification of discretized gene expression data, and in discrete gene expression prediction and the inference of boolean genomic regulatory networks. Once a discrete classifier is obtained from sample data, its performance must be evaluated through its classification error. In practice, error estimation methods must then be employed to obtain reliable estimates of the classification error based on the available dat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0
4

Year Published

2010
2010
2017
2017

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 50 publications
0
7
0
4
Order By: Relevance
“…This method provides a measure of the network's learning ability; yet it is not preferable for performance evaluation tasks as it is known to be optimistically biased. However, as shown in [ 52 , 53 ], in discrete classification problems with large-sample categorical datasets, like the classification problem of this study, resubstitution can be significantly accurate relative to more complex error estimation schemes, since the optimistic bias and the variance of the method tend to be vanished as the sample size increases, provided that classifier complexity is not too high. For this reason we decided to take into consideration the performance of the final models when for testing the training and the validation sets are used.…”
Section: Methodsmentioning
confidence: 90%
“…This method provides a measure of the network's learning ability; yet it is not preferable for performance evaluation tasks as it is known to be optimistically biased. However, as shown in [ 52 , 53 ], in discrete classification problems with large-sample categorical datasets, like the classification problem of this study, resubstitution can be significantly accurate relative to more complex error estimation schemes, since the optimistic bias and the variance of the method tend to be vanished as the sample size increases, provided that classifier complexity is not too high. For this reason we decided to take into consideration the performance of the final models when for testing the training and the validation sets are used.…”
Section: Methodsmentioning
confidence: 90%
“…In analogous fashion to the performance metrics of prediction error estimators [8], the key performance metrics for an CoD estimator CoD are its bias…”
Section: Performance Metrics Of Cod Estimatorsmentioning
confidence: 99%
“…The error of the best predictor corresponds to the optimal prediction error, also known as Bayes error, which depends only on the underlying probability model [7]. However, in practical real-world problems, the underlying probability model is unknown, and thus we arrive at the fundamental issue of how to find a good prediction error estimator in small-sample settings [8,9]. An error estimator may be a deterministic function of the sample data, in which case it is called a nonrandomized error estimator; such popular error estimators as resubstitution and leave-oneout are examples.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, classifier design and error estimation of the designed classifiers become key factors to perform such a process successfully. From these two tasks, error estimation is perhaps the most critical task since the error estimator needs to preserve the true ordering [11][12][13][14].…”
Section: Introductionmentioning
confidence: 99%