2021
DOI: 10.7717/peerj.10884
|View full text |Cite
|
Sign up to set email alerts
|

Predicting lung adenocarcinoma disease progression using methylation-correlated blocks and ensemble machine learning classifiers

Abstract: Applying the knowledge that methyltransferases and demethylases can modify adjacent cytosine-phosphorothioate-guanine (CpG) sites in the same DNA strand, we found that combining multiple CpGs into a single block may improve cancer diagnosis. However, survival prediction remains a challenge. In this study, we developed a pipeline named “stacked ensemble of machine learning models for methylation-correlated blocks” (EnMCB) that combined Cox regression, support vector regression (SVR), and elastic-net models to c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 67 publications
0
7
0
Order By: Relevance
“…Similar to clustering-based workflows, an initial screening was generally the first step for the 30 studies using supervised feature selection-based workflows (Figure 4.2) (6-9, 24-28, 30, 31, 36, 40-42, 48, 51, 59, 60, 63, 66-68, 71-74, 81, 84, 85). In the next step, nearly all studies used one to three types of supervised feature selection techniques to pre-select prognostically relevant DNA methylation biomarkers prior to training one or more prognostic models.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…Similar to clustering-based workflows, an initial screening was generally the first step for the 30 studies using supervised feature selection-based workflows (Figure 4.2) (6-9, 24-28, 30, 31, 36, 40-42, 48, 51, 59, 60, 63, 66-68, 71-74, 81, 84, 85). In the next step, nearly all studies used one to three types of supervised feature selection techniques to pre-select prognostically relevant DNA methylation biomarkers prior to training one or more prognostic models.…”
Section: Resultsmentioning
confidence: 99%
“…In the stage of model training, further feature selection was performed by more than half of the 30 studies (8, 9, 24, 25, 28, 31, 36, 40-42, 48, 51, 66, 68, 72-74, 81, 84, 85). Sample-splitting was the most commonly used internal model validation method ( N = 12), and external validation was performed in 15 studies (7, 8, 24-26, 31, 42, 48, 59, 66-68, 72, 74, 81).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations