2020
DOI: 10.1007/s11222-020-09971-5
|View full text |Cite
|
Sign up to set email alerts
|

An apparent paradox: a classifier based on a partially classified sample may have smaller expected error rate than that if the sample were completely classified

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
18
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(18 citation statements)
references
References 18 publications
0
18
0
Order By: Relevance
“…Following on from the exploratory examination by Ahfock and McLachlan (2019) of partially classified data sets, Ahfock and McLachlan (2020) proposed to treat the labels of the unclassified features as missing data and to introduce a framework for their missingness as in the pioneering work of Rubin (1976) for missingness in incomplete-data analysis. Within this framework, they postulated the dependence of the conditional probability that a label is missing given the data by the logistic model with covariate equal to an entropy-based measure.…”
Section: Modelling Missingness For Unobserved Class Labelsmentioning
confidence: 99%
See 4 more Smart Citations
“…Following on from the exploratory examination by Ahfock and McLachlan (2019) of partially classified data sets, Ahfock and McLachlan (2020) proposed to treat the labels of the unclassified features as missing data and to introduce a framework for their missingness as in the pioneering work of Rubin (1976) for missingness in incomplete-data analysis. Within this framework, they postulated the dependence of the conditional probability that a label is missing given the data by the logistic model with covariate equal to an entropy-based measure.…”
Section: Modelling Missingness For Unobserved Class Labelsmentioning
confidence: 99%
“…This is not surprising as entities with features in such regions would tend to be representative of entities that would be difficult to classify correctly, as illustrated for three datasets in the previous section. Ahfock and McLachlan (2020) showed how this dependency on the missingness pattern can be leveraged to provide additional information about the parameters in the optimal classifier specified by the Bayes' rule.…”
Section: Modelling Missingness For Unobserved Class Labelsmentioning
confidence: 99%
See 3 more Smart Citations