2013
DOI: 10.5120/9983-4814
|View full text |Cite
|
Sign up to set email alerts
|

Review on Feature Selection Techniques of DNA Microarray Data

Abstract: Feature selection from DNA microarray data is one of the most important procedures in bioinformatics. The huge dimensionality of the DNA microarray data becomes a problem when it is used for cancer classification. This problem can be alleviated by employing feature selection as a preprocessing step in classification.This paper reviews some of the major feature selection techniques employed in microarray data and points out the merits and demerits of various approaches.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 18 publications
0
0
0
Order By: Relevance
“…𝑘∈𝑁 𝑖 (9) where 𝑁 𝑖 denotes the set of neighbors of 𝑥 𝑖 . The purpose of NCA is to learn a linear transform 𝐴 that maximizes log probability by selecting each data point as neighbors with the same labels as itself after the transformation [46].…”
Section: ) Neighborhood Component Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…𝑘∈𝑁 𝑖 (9) where 𝑁 𝑖 denotes the set of neighbors of 𝑥 𝑖 . The purpose of NCA is to learn a linear transform 𝐴 that maximizes log probability by selecting each data point as neighbors with the same labels as itself after the transformation [46].…”
Section: ) Neighborhood Component Analysismentioning
confidence: 99%
“…One of the important problems with these methods is to select multiple features carrying the same information when strong correlations exist between features unnecessarily. For example, high-dimensional and strongly correlated data such as reflectance, image, text or DNA microarray data problem arises that hindering the learning process, especially in classification [6,[8][9][10][11]. The focus of this study is to determine the variables that will give the highest classification accuracy in classification of high dimensional data.…”
Section: Introductionmentioning
confidence: 99%