2001
DOI: 10.1006/mgme.2001.3193
|View full text |Cite
|
Sign up to set email alerts
|

Feature (Gene) Selection in Gene Expression-Based Tumor Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
59
0
1

Year Published

2002
2002
2009
2009

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 123 publications
(61 citation statements)
references
References 15 publications
1
59
0
1
Order By: Relevance
“…A s explained by Xiong et al (1), there is increasing interest in changing the emphasis of tumor classification from morphologic to molecular. In this context, the problem is to construct a classifier or a prediction (discriminant) rule R that can accurately predict the class of origin of a tumor tissue with feature vector x, which is unclassified with respect to a known number g (Ն2) of distinct tissue types.…”
mentioning
confidence: 99%
“…A s explained by Xiong et al (1), there is increasing interest in changing the emphasis of tumor classification from morphologic to molecular. In this context, the problem is to construct a classifier or a prediction (discriminant) rule R that can accurately predict the class of origin of a tumor tissue with feature vector x, which is unclassified with respect to a known number g (Ն2) of distinct tissue types.…”
mentioning
confidence: 99%
“…In a two group setting the BW-ratio reduces to the same statistic as the t-test. The prediction strength (PS) (Xiong et al 2001) of a certain gene is defined as the ratio of the difference in mean log expression level between the two groups and the sum of the variances of the two classes. The between-class scatter score (BC-score) belongs to the class of correlation scores (Chai and Domenicioni 2004).…”
Section: Other Test Statisticsmentioning
confidence: 99%
“…Recent researches have shown that a small number of genes is sufficient for accurate diagnosis of most cancers, even though the number of genes vary greatly between different diseases [15]. Indeed, a large set of gene expression features will not only significantly bring higher computational cost and slow down the learning process, but also decrease the classification accuracy due to the phenomenon known as curse of dimensionality, in which the risk of over-fitting increases as the number of selected genes grows [15].…”
Section: Gene Selectionmentioning
confidence: 99%
“…Indeed, a large set of gene expression features will not only significantly bring higher computational cost and slow down the learning process, but also decrease the classification accuracy due to the phenomenon known as curse of dimensionality, in which the risk of over-fitting increases as the number of selected genes grows [15]. More importantly, by using a small subset of genes, we can not only get a better diagnostic accuracy, but also get an opportunity to further analyse the nature of the disease and the genetic mechanisms responsible for it.…”
Section: Gene Selectionmentioning
confidence: 99%