1972
DOI: 10.1080/01621459.1972.10482414
|View full text |Cite
|
Sign up to set email alerts
|

The Treatment of Missing Values in Discriminant Analysis—I. The Sampling Experiment

Abstract: 2.1) (2.3) Probabilitie. of correct cla..ificafion under several commonly u.ed method. of handling mi.sing value. are .tudied by Monte Carlo method•. The method. include u'e of only complete observation vectors; u'e of all observation. with no replacement; .ub.titution of mean. for missing ob.ervations; Buck's regression method; and, Dear'. principal component method. Discriminant functions were formed u.ing independent random sample. from two multivariate normal distribution. with equal covariance matrices. M… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
33
0

Year Published

1995
1995
2017
2017

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 34 publications
(34 citation statements)
references
References 24 publications
1
33
0
Order By: Relevance
“…Pairwise deletion is typically more accurate (Gleason & Staelin, 1975;Kim & Curry, 1977) and regression imputation is most accurate (Beale & Little, 1975;Buck, 1960;Chan & Dunn, 1972;Gleason & Staelin, 1975;Raymond & Roberts, 1987). Mean substitution is problematic since inserting means in place of missing data tends to reduce variance in a given variable and hence reduce covariance with another variable.…”
Section: Missing Data Techniquesmentioning
confidence: 97%
“…Pairwise deletion is typically more accurate (Gleason & Staelin, 1975;Kim & Curry, 1977) and regression imputation is most accurate (Beale & Little, 1975;Buck, 1960;Chan & Dunn, 1972;Gleason & Staelin, 1975;Raymond & Roberts, 1987). Mean substitution is problematic since inserting means in place of missing data tends to reduce variance in a given variable and hence reduce covariance with another variable.…”
Section: Missing Data Techniquesmentioning
confidence: 97%
“…Although these sophisticated methods exist for handling missing data, the effects of using some of the imputation methods on the performance of statistical discrimination algorithms remain unknown. Previously, Chan and Dunn [2] and Chan et al [3] have compared several ad hoc techniques to replace missing data, while Twedt and Gill [4] have compared algorithms such as principal component projections and the EM algorithm in discriminant analysis. More recently, Jurkowski and co-workers [5] compared different approaches to missing data in discrimination applied to medical problems based on Gower's distance.…”
Section: Introductionmentioning
confidence: 99%
“…All the missing data are replacing with the value of mean available neighboring data [27]. Furthermore, other studies stated that mean substitution method is better and more accurate rather than eliminating the missing value with list wise and pairwise deletion method [28,29,30]. …”
Section: Data Cleaningmentioning
confidence: 99%