2020
DOI: 10.1016/j.compbiomed.2020.103761
|View full text |Cite
|
Sign up to set email alerts
|

Diagnostic classification of cancers using extreme gradient boosting algorithm and multi-omics data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
82
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 151 publications
(82 citation statements)
references
References 46 publications
0
82
0
Order By: Relevance
“…Random forest is a decision tree algorithm that has shown to be one of the most effective methods for classification of microbiome data, particularly 16S rRNA sequencing data ( Statnikov et al, 2013 ). XGBoost, also a tree-based algorithm, has been recently shown to outperform other machine learning algorithms on a variety of biological datasets ( Dimitrakopoulos et al, 2018 ; Ma et al, 2020 ). Further, we included ridge regression, another widely used algorithm that differs from these tree-based models in that it is a logistic regression algorithm with L2 regularization that still enables us to compare its feature ranking to other algorithms.…”
Section: Introductionmentioning
confidence: 99%
“…Random forest is a decision tree algorithm that has shown to be one of the most effective methods for classification of microbiome data, particularly 16S rRNA sequencing data ( Statnikov et al, 2013 ). XGBoost, also a tree-based algorithm, has been recently shown to outperform other machine learning algorithms on a variety of biological datasets ( Dimitrakopoulos et al, 2018 ; Ma et al, 2020 ). Further, we included ridge regression, another widely used algorithm that differs from these tree-based models in that it is a logistic regression algorithm with L2 regularization that still enables us to compare its feature ranking to other algorithms.…”
Section: Introductionmentioning
confidence: 99%
“…Unclear subtype refers to patient samples that lack information on each of the three IHC markers. With the explosive growth of massive biological data, the transformation of traditional biological statistical methods to computer-aided methods makes machine learning become an important part of predicting cancer prognosis [14]. If all the features in these samples are used to classify and regress, it will lead to overfitting.…”
Section: Introductionmentioning
confidence: 99%
“…The main objective of this algorithm to achieved high accuracy or increase retrained the weak learners for particular data into strong learners (Table 5). Ma B. et al, (2020), introduced about Meta classifier ensemble method in data science and convert organized weak learners into strong learners. Gradient boosting works as adaptive boosting algorithms but differ in optimize problems.…”
Section: Adaboostm1 Ensemble Methodsmentioning
confidence: 99%