2021
DOI: 10.3389/fpubh.2021.668351
|View full text |Cite
|
Sign up to set email alerts
|

Fatty Liver Disease Prediction Model Based on Big Data of Electronic Physical Examination Records

Abstract: Fatty liver disease (FLD) is a common liver disease, which poses a great threat to people's health, but there is still no optimal method that can be used on a large-scale screening. This research is based on machine learning algorithms, using electronic physical examination records in the health database as data support, to a predictive model for FLD. The model has shown good predictive ability on the test set, with its AUC reaching 0.89. Since there are a large number of electronic physical examination record… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…The use of big data may avoid overfitting the trained model. Despite the utilization of a large dataset for model training in a previous study [ 33 ], there was no comparison of the model’s performance with that of an existing validated index as in the present study. The use of the developed model may require validation before its application in clinical practice.…”
Section: Discussionmentioning
confidence: 91%
See 1 more Smart Citation
“…The use of big data may avoid overfitting the trained model. Despite the utilization of a large dataset for model training in a previous study [ 33 ], there was no comparison of the model’s performance with that of an existing validated index as in the present study. The use of the developed model may require validation before its application in clinical practice.…”
Section: Discussionmentioning
confidence: 91%
“…This model adds a regular term to the cost function to control the complexity of the model, simplifying it and preventing overfitting with improved training speed. In addition, xgBoost is a model based on the decision tree model, and it is more explanatory than neural networks and other algorithms are [ 33 ].…”
Section: Discussionmentioning
confidence: 99%
“…Participants were randomly assigned to the training and internal validation sets in a 7:3 ratio [ 29 , 30 ]. In addition, to further validate the performance of the prediction model, we used the follow-up population from 2015 to 2020 as an external validation set.…”
Section: Methodsmentioning
confidence: 99%