2022
DOI: 10.3389/fgene.2021.793629
|View full text |Cite
|
Sign up to set email alerts
|

Feature Selection of OMIC Data by Ensemble Swarm Intelligence Based Approaches

Abstract: OMIC datasets have high dimensions, and the connection among OMIC features is very complicated. It is difficult to establish linkages among these features and certain biological traits of significance. The proposed ensemble swarm intelligence-based approaches can identify key biomarkers and reduce feature dimension efficiently. It is an end-to-end method that only relies on the rules of the algorithm itself, without presets such as the number of filtering features. Additionally, this method achieves good class… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
7
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 54 publications
0
7
0
Order By: Relevance
“…Thereafter, we designated TCGA as our training matrix, while GSE103091, GSE17907, and GSE26304 served as our external validation cohorts. Multiple machine learning paradigms, encompassing Cox boost, Lasso, 21 Elastic Net (Enet), Ridge, survival‐support vector machine (survival‐svm), 22 generalized augmented regression modeling (GBM), Cox's partial least squares Regression (plsRcox), Stepwise Cox, Random Forest (RSF), and Supervised Principal Component (SuperPC), were employed. From these, we forged 101 algorithmic amalgamations.…”
Section: Methodsmentioning
confidence: 99%
“…Thereafter, we designated TCGA as our training matrix, while GSE103091, GSE17907, and GSE26304 served as our external validation cohorts. Multiple machine learning paradigms, encompassing Cox boost, Lasso, 21 Elastic Net (Enet), Ridge, survival‐support vector machine (survival‐svm), 22 generalized augmented regression modeling (GBM), Cox's partial least squares Regression (plsRcox), Stepwise Cox, Random Forest (RSF), and Supervised Principal Component (SuperPC), were employed. From these, we forged 101 algorithmic amalgamations.…”
Section: Methodsmentioning
confidence: 99%
“…These integrative methods included Lasso, elastic network (Enet), Ridge, stepwise Cox, CoxBoost, partial least squares regression for Cox (plsRcox), RSF, SuperPC, generalized boosted regression modeling (GBM), and survival support vector machine (survival-SVM). The process for creating signatures was as follows: the 173 key genes regulating GM activity were used to fit prediction models based on the leave-one-out cross-validation (LOOCV) framework in the TCGA-LUAD cohort using 117 algorithm combinations ( 23 ). All models were evaluated in eight validation datasets (GSE13213, GSE26939, GSE29016, GSE30219, GSE31210, GSE37745, GSE42127, GSE68465).…”
Section: Methodsmentioning
confidence: 99%
“…To differentiate GBM patients with low ERG scores from those with high scores based on their immune cell features, we employed the single-sample gene set enrichment analysis (ssGSEA) technique. We also estimated the immune and stromal scores of each glioma sample using the R program “ESTIMATE,” which provides an estimate of the quantities of immune and stromal components present in vivo [ 40 ]. To comprehensively investigate the tumor microenvironment’s heterogeneity in different datasets and cell types, we utilized the Tumor Immune Single-Cell Hub (TISCH) database.…”
Section: Methodsmentioning
confidence: 99%