2014
DOI: 10.1515/ijb-2013-0038
|View full text |Cite
|
Sign up to set email alerts
|

Estimation of a Predictor’s Importance by Random Forests When There Is Missing Data: RISK Prediction in Liver Surgery using Laboratory Data

Abstract: In the last few decades, new developments in liver surgery have led to an expanded applicability and an improved safety. However, liver surgery is still associated with postoperative morbidity and mortality, especially in extended resections. We analyzed a large liver surgery database to investigate whether laboratory parameters like haemoglobin, leucocytes, bilirubin, haematocrit and lactate might be relevant preoperative predictors. It is not uncommon to observe missing values in such data. This also holds f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 49 publications
0
9
0
Order By: Relevance
“…For example, RF has been used to improve the prediction of missing values using laboratory generated medical data [16] and also to classify salt marsh vegetation [34]. In general, RF is a decision tree based method for classification and regression.…”
Section: Data Recovery Using Machine Learning Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, RF has been used to improve the prediction of missing values using laboratory generated medical data [16] and also to classify salt marsh vegetation [34]. In general, RF is a decision tree based method for classification and regression.…”
Section: Data Recovery Using Machine Learning Methodsmentioning
confidence: 99%
“…The importance of data recovery in numerous fields such as medical [15,16], neuro-computation [17], and climate science [18,19] has long been realized. Several approaches have been applied to recover the value of missing data.…”
Section: Past Research On Data Recoverymentioning
confidence: 99%
“…In this model, the importance of the variables entered into the model is measured based on two criteria, mean decrease in accuracy and mean decrease in Gini [114]. This feature of the model has led to its use in many studies to measure the importance of variables [115][116][117][118][119]. There are other advantages that have led to the increase in the usefulness of this model.…”
Section: Importance Of Influential Variables In the Modelingmentioning
confidence: 99%
“…Measuring variable importance becomes increasingly complicated when data are missing. Hapfelmeier et al (2012Hapfelmeier et al ( , 2014a introduce an approach for estimating variable importance in such settings. This approach is intended to capture the value of the information provided by the data present, and variables with large amounts of missing data are often ranked as less important than they would be if complete data were available.…”
Section: Variable Importancementioning
confidence: 99%
“…Manual-setting up, and understanding random forests V4.0. Hapfelmeier, A., Hothorn, T., Riediger, C., and Ulm, K. (2014a). Estimation of a predictor's importance by random forests when there is missing data: risk prediction in liver surgery using laboratory data.…”
Section: Future Research Focused On the Development Of Variable Impormentioning
confidence: 99%