2014
DOI: 10.2478/amcs-2014-0048
|View full text |Cite
|
Sign up to set email alerts
|

Data mining methods for gene selection on the basis of gene expression arrays

Abstract: The paper presents data mining methods applied to gene selection for recognition of a particular type of prostate cancer on the basis of gene expression arrays. Several chosen methods of gene selection, including the Fisher method, correlation of gene with a class, application of the support vector machine and statistical hypotheses, are compared on the basis of clustering measures. The results of applying these individual selection methods are combined together to identify the most often selected genes formin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 24 publications
0
12
0
Order By: Relevance
“…The second applied filter method was feature correlation with class (COR), a univariate filter feature selection method that can be used as a pre-selection step in microarray gene selection [ 54 , 55 ]. The value of feature discrimination, S(f) , is expressed by where c is the mean value for the gene among both classes, c k is the mean value for the k th class gene, σ 2 ( f ) is the gene variance, and P k is the probability of appearance of the k th class in the dataset.…”
Section: Methodsmentioning
confidence: 99%
“…The second applied filter method was feature correlation with class (COR), a univariate filter feature selection method that can be used as a pre-selection step in microarray gene selection [ 54 , 55 ]. The value of feature discrimination, S(f) , is expressed by where c is the mean value for the gene among both classes, c k is the mean value for the k th class gene, σ 2 ( f ) is the gene variance, and P k is the probability of appearance of the k th class in the dataset.…”
Section: Methodsmentioning
confidence: 99%
“…First, the procedure Generate creates a new concept and adds the new concept to concept lattice (lines 1-2). According to Proposition 9, we test every candidate in c.Children to find real children of newConcept (lines [4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][19][20][21][22]. Note that the concept c.children.indicator points to has already been obtained after executing the Preprocessprocedure.…”
Section: Generation and Removal Of Conceptsmentioning
confidence: 99%
“…Algorithm 1 reveals that c can be marked directly if c is a merged concept. When c is a deleted or modified concept, it requires comparisons between c andc.Parentsfor finding a parent with c.Intent = parent.Intent (lines [5][6][7][8][9][10][11][12][13]. This operation requires only one comparison in the best case or |G| comparisons at worst case between sets (intents) which takes at most O(|G||M | 2 ) time.…”
Section: Complexity Issuesmentioning
confidence: 99%
“…Gene selection, according to biologists, results in more compact gene sets, which lowers diagnostics costs and makes it easier to comprehend the roles of linked genes [2]. In the high-dimensional space of a small number of observations, comparing gene expression profiles and picking those that are best related with the examined forms of data is a difficult issue in pattern recognition, which can be tackled utilizing specialized data mining approaches [3]. Despite the rapid advancements in this subject, there is always a need for further understanding and research development.…”
Section: Introductionmentioning
confidence: 99%