2020
DOI: 10.1016/j.ijcce.2020.11.001
|View full text |Cite
|
Sign up to set email alerts
|

Microarray cancer feature selection: Review, challenges and research directions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
48
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 57 publications
(48 citation statements)
references
References 153 publications
0
48
0
Order By: Relevance
“…Indeed, in the genomic domain (Figure 2), the hybrid learning approach gives results that are always superior to feature selection alone, as well as superior or comparable to data balancing or cost-sensitive learning alone, but with significant advantages in terms of computational cost and domain understanding, as only the most predictive features are used for prediction. The importance of devising learning strategies that use a reduced number of genes for cancer diagnosis, while ensuring at the same time a good predictive performance, has been widely highlighted in this domain [41,[48][49][50], and the hybrid approaches here discussed seem to provide a viable solution in this respect.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Indeed, in the genomic domain (Figure 2), the hybrid learning approach gives results that are always superior to feature selection alone, as well as superior or comparable to data balancing or cost-sensitive learning alone, but with significant advantages in terms of computational cost and domain understanding, as only the most predictive features are used for prediction. The importance of devising learning strategies that use a reduced number of genes for cancer diagnosis, while ensuring at the same time a good predictive performance, has been widely highlighted in this domain [41,[48][49][50], and the hybrid approaches here discussed seem to provide a viable solution in this respect.…”
Section: Resultsmentioning
confidence: 99%
“…For the genomic domain, we considered two highly imbalanced benchmarks from the GEMLeR collection [40]: the task is to discriminate uterus or omentum cancer from other cancer types, based on the expression level of over ten thousand genes. Since the available instances (i.e., the biological samples) are far fewer than the features (i.e., the genes), this kind of classification task turns out to be especially challenging, as recognized by a vast literature in the field [21,41].…”
Section: Datasets and Settingsmentioning
confidence: 99%
“…Cancer is one of the most fatal diseases around the globe ( Tabares-Soto et al, 2020 ; Hambali, Oladele & Adewole, 2020 ). According to the World Health Organization report, Cancer is marked as the second most deadly disease and an estimated 9.7 million deaths around the world in 2018 have occurred due to this signature disease ( Hambali, Oladele & Adewole, 2020 ). Generally, one in every six deaths all over the world, occurs due to cancer.…”
Section: Introductionmentioning
confidence: 99%
“…Analyzing gene expression data is crucial to find out harmful mutations and to avoid further consequences. [2] In recent years enormous research has been carried out in detecting type of cancer using genomic profiles. Still to achieve satisfactory cancer classification accuracy with the complete set of genes remains a great challenge, due to the high dimensions, small sample size, and presence of noise in gene expression data.…”
Section: Introductionmentioning
confidence: 99%
“…As per the survey done in [2] To solve the curse of dimensionality statistical methods like regression & non-parametric regression methods are used on Leukemia Microarray dataset resulted in better accuracy. [12] Grouping Genetic Algorithm (GGA) implemented on RNA-seq data for five types of cancers gave average accuracy of 98.81 & standard deviation of 0.0174.…”
Section: Introductionmentioning
confidence: 99%