2021
DOI: 10.2196/27344
|View full text |Cite
|
Sign up to set email alerts
|

Discovery of Depression-Associated Factors From a Nationwide Population-Based Survey: Epidemiological Study Using Machine Learning and Network Analysis

Abstract: Background In epidemiological studies, finding the best subset of factors is challenging when the number of explanatory variables is large. Objective Our study had two aims. First, we aimed to identify essential depression-associated factors using the extreme gradient boosting (XGBoost) machine learning algorithm from big survey data (the Korea National Health and Nutrition Examination Survey, 2012-2016). Second, we aimed to achieve a comprehensive unde… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
7
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 16 publications
(7 citation statements)
references
References 36 publications
0
7
0
Order By: Relevance
“…XGBoost is an ensemble learning classifier that utilizes a Gradient Boosting framework to solve supervised learning problems. XGBoost has been widely used in medical studies to predict or screen prognosis [ 33 , 34 , 35 ]. XGBoost also uses a trained prediction model to provide a regularized gradient enhancement and feature importance score, which can be applied to feature selection.…”
Section: Methodsmentioning
confidence: 99%
“…XGBoost is an ensemble learning classifier that utilizes a Gradient Boosting framework to solve supervised learning problems. XGBoost has been widely used in medical studies to predict or screen prognosis [ 33 , 34 , 35 ]. XGBoost also uses a trained prediction model to provide a regularized gradient enhancement and feature importance score, which can be applied to feature selection.…”
Section: Methodsmentioning
confidence: 99%
“…Extreme gradient boosting (XGBoost) has been widely applied in various aspects, such as credit scoring, transportation or even medicine (for instance, for cancer morbidity prediction). Several recent studies show that GBT is the best performing algorithm in the prediction of various aspects, such as the prediction of suicide, psychological health, stress, mental health problems or depression [68][69][70][71][72][73][74][75][76][77][78][79][80][81]. Sanderson et al [68] claim that "the gradient boosted trees model class appeared to be the most promising model class for future research".…”
Section: Methodsmentioning
confidence: 99%
“…Likewise, the polygenic risk score of CRP (but not IL-6 and other proinflammatory proteins) was most potently linked to fatigue and decreased anhedonia (Kappelmann et al, 2021). In addition, levels of triglyceride, total cholesterol, and insulin resistance, but not HDL, displayed the most substantial concurrent relations with higher depression severity in Korean adults (Nam et al, 2021).…”
mentioning
confidence: 90%