2020
DOI: 10.1504/ijbdm.2020.106883
|View full text |Cite
|
Sign up to set email alerts
|

Missing data imputation by the aid of features similarities

Abstract: The missing data is likely to occur in statistical analyses. The quality of the data is affected by the used imputation method. In this paper, a method is proposed to impute the missing data on variables of interest (i.e., recipient) using observed values from other variables (i.e., donors). Some existing methods rely upon only the recipient (e.g., unconditional means), others rely on the recipient and one donor (i.e., interpolation). The proposed method depends on the similarities of the values in the donor t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 24 publications
0
6
0
Order By: Relevance
“…This included addressing missing data points, which can significantly impact model performance. Notably, all missing values represented by "NaN" (Not a Number) were imputed using a suitable method [16,17]. • Data splitting: Following this cleaning process, the dataset was strategically divided into two distinct sets: a training set and a testing set.…”
Section: Methodsmentioning
confidence: 99%
“…This included addressing missing data points, which can significantly impact model performance. Notably, all missing values represented by "NaN" (Not a Number) were imputed using a suitable method [16,17]. • Data splitting: Following this cleaning process, the dataset was strategically divided into two distinct sets: a training set and a testing set.…”
Section: Methodsmentioning
confidence: 99%
“…Detecting the missingness mechanism is considered an important step for manipulating MVs. This paper considers and deals with the three kinds of missingness mechanisms [5][6][7][8].…”
Section: Missingness Mechanismsmentioning
confidence: 99%
“…• Missing completely at random (MCAR) [12]: Assume that the missing value indicator matrix M = M ij and the complete data Y = y ij . The missing data mechanism is described by the conditional distribution of M given Y, say f (M Y, ∅) where ∅ represents the unknown parameter.…”
Section: Missingness Mechanismsmentioning
confidence: 99%
“…• Missing at random (MAR) [12]: Let Y mis and Y obs denote missing data and observed data, respectively. If the Missingness do not depend on the data that are missing, but depends only on Y obs of Y, then, f (M|Y, ∅) = f (M|Y obs , ∅) f or all Y mis , ∅…”
Section: Missingness Mechanismsmentioning
confidence: 99%
See 1 more Smart Citation