2016
DOI: 10.1186/s12911-016-0318-z
|View full text |Cite
|
Sign up to set email alerts
|

Nearest neighbor imputation algorithms: a critical evaluation

Abstract: BackgroundNearest neighbor (NN) imputation algorithms are efficient methods to fill in missing data where each missing value on some records is replaced by a value obtained from related cases in the whole set of records. Besides the capability to substitute the missing data with plausible values that are as close as possible to the true value, imputation algorithms should preserve the original data structure and avoid to distort the distribution of the imputed variable. Despite the efficiency of NN algorithms … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

2
326
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 457 publications
(329 citation statements)
references
References 19 publications
2
326
0
1
Order By: Relevance
“…From their medical history, we knew they did not suffer from the entities e‐m. In accordance with our previous study, we used nearest neighbour hot deck‐imputation to classify this small group of patients (6.9%) into the subgroups a‐d (GD/MNTG/mixed‐type/STA: n=10/9/8/1). Nearest neighbour hot deck‐imputation did not alter any of the results of the present study.…”
Section: Methodsmentioning
confidence: 96%
“…From their medical history, we knew they did not suffer from the entities e‐m. In accordance with our previous study, we used nearest neighbour hot deck‐imputation to classify this small group of patients (6.9%) into the subgroups a‐d (GD/MNTG/mixed‐type/STA: n=10/9/8/1). Nearest neighbour hot deck‐imputation did not alter any of the results of the present study.…”
Section: Methodsmentioning
confidence: 96%
“…In this selected cohort, some variables presented with missing values (see Figure S2 for distribution of missing values across the studied variables). The k ‐nearest neighbour ( k NN) method was used to impute the missing values with 5 neighbours (ie, k = 5) . The k NN method is an efficient imputation technique by which each missing value on some data fields is replaced with a value calculated from related neighbouring cases across the whole records of a dataset.…”
Section: Methodsmentioning
confidence: 99%
“…The k NN method is an efficient imputation technique by which each missing value on some data fields is replaced with a value calculated from related neighbouring cases across the whole records of a dataset. It has demonstrated good capacity for preserving the original data structure . For each continuous univariable variable, a two‐sample t test was used in between two subgroups at 5% significance level, and a Pearson chi‐squared test was used in categorical variables with 5% significance level.…”
Section: Methodsmentioning
confidence: 99%
“…This algorithm is an efficient method to fill in missing data. Each missing value on a record is replaced by a value from related cases in the whole set of records that depends on the type of variable used: categorical missing values are replaced by the mode and quantitative ones are replaced by the mean . The number of neighbors was fixed to 10 before conducting experiments.…”
Section: Methodsmentioning
confidence: 99%