2021
DOI: 10.1063/5.0053286
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of five imputation methods in handling missing data in a continuous frequency table

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 27 publications
0
0
0
Order By: Relevance
“…For missing values, we employed a variety of imputation methods, including mean, median, mode, and K-nearest neighbors (KNN) imputation, selecting the most appropriate technique based on the nature of the data [26]. To identify and correct outliers, we consulted with pharmacy experts to discern whether an anomaly was a true outlier or a data entry error.…”
Section: Data Cleaning and Transformationmentioning
confidence: 99%
“…For missing values, we employed a variety of imputation methods, including mean, median, mode, and K-nearest neighbors (KNN) imputation, selecting the most appropriate technique based on the nature of the data [26]. To identify and correct outliers, we consulted with pharmacy experts to discern whether an anomaly was a true outlier or a data entry error.…”
Section: Data Cleaning and Transformationmentioning
confidence: 99%
“…Let 𝑦 1 , 𝑦 2 ,…, 𝑦 𝑛0 be the observed values and 𝑦 1 π‘šπ‘–π‘  , 𝑦 2 π‘šπ‘–π‘  ,…, 𝑦 𝑛 𝑝 π‘šπ‘–π‘  be the missing observations of a dataset containing n=𝑛 0 +𝑛 𝑝 observations; the random sample imputation replaces the missing observations with a sample of 𝑛 𝑝 from 𝑛 0 observed values. The random sample estimation has the benefit of working with both continuous and discrete variables (Mohammed et al, 2021).…”
Section: Sample Imputationmentioning
confidence: 99%
“…In the unimodal datasets, participants and features with more than 55% missing values were excluded (deviating from our pre-registered 50% as this would have reduced our sample sizes too much). Remaining missing values were imputed using the k-nearest neighbors 80 method, a relatively simple non-parametric multivariate imputation technique more timeefficient than more complex imputation techniques 81 . Following common practice, we chose k by taking the square root of the number of observations in each of the respective datasets.…”
Section: Missing Value Imputationmentioning
confidence: 99%