2021
DOI: 10.3389/fgene.2021.691274
|View full text |Cite
|
Sign up to set email alerts
|

ImputEHR: A Visualization Tool of Imputation for the Prediction of Biomedical Data

Abstract: Electronic health records (EHRs) have been widely adopted in recent years, but often include a high proportion of missing data, which can create difficulties in implementing machine learning and other tools of personalized medicine. Completed datasets are preferred for a number of analysis methods, and successful imputation of missing EHR data can improve interpretation and increase our power to predict health outcomes. However, use of the most popular imputation methods mainly require scripting skills, and ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 30 publications
0
1
0
Order By: Relevance
“…Feature pre-processing included clipping the outlier values to the 5th and 95th percentile values and scaling between [0,1] using the Minmax Scalar package from sklearn (version 1.5.0) [34]. Since each of the selected features are ordinal variables and had a very low missing rate of <5%, we imputed the missing values for each feature column using the median value of that feature across all visits of all patients, following previous work [35] (Fig 1).…”
Section: Feature Pre-processingmentioning
confidence: 99%
“…Feature pre-processing included clipping the outlier values to the 5th and 95th percentile values and scaling between [0,1] using the Minmax Scalar package from sklearn (version 1.5.0) [34]. Since each of the selected features are ordinal variables and had a very low missing rate of <5%, we imputed the missing values for each feature column using the median value of that feature across all visits of all patients, following previous work [35] (Fig 1).…”
Section: Feature Pre-processingmentioning
confidence: 99%