2013
DOI: 10.5120/14198-2392
|View full text |Cite
|
Sign up to set email alerts
|

A Comparative Study of Microarray Data Analysis for Cancer Classification

Abstract: Cancer is most deadly human disease. According to WHO 7.6 million deaths (around 13% of all deaths) in 2008 were caused by cancer. A Cancer diagnosis can be achieved with gene expression microarray data. Microarray allows monitoring of thousands of genes of a sample simultaneously. But all the genes in gene expression data are not informative. The relevant gene selection/extraction is the main challenge in microarray data analysis. Microarray data classification is two stage process i.e. features selection and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 9 publications
0
3
0
Order By: Relevance
“…The Weka machine learning environment is employed in this research https://ai.waikato.ac.nz/weka/ , as the Weka resource provides a number of techniques that can be used for data validation. Two such techniques are ‘leave-one-out cross-validation’, or LOOCV, and k -fold cross-validation, both of which randomly classify items of data as being part of either ‘training’ or a ‘testing’ set [ 81 , 82 ]. The LOOCV approach involves a ‘classifier’ being learned for all bar one of a sample and tested on that one data point [ 83 ].…”
Section: Dataset Tools and Techniques Appliedmentioning
confidence: 99%
“…The Weka machine learning environment is employed in this research https://ai.waikato.ac.nz/weka/ , as the Weka resource provides a number of techniques that can be used for data validation. Two such techniques are ‘leave-one-out cross-validation’, or LOOCV, and k -fold cross-validation, both of which randomly classify items of data as being part of either ‘training’ or a ‘testing’ set [ 81 , 82 ]. The LOOCV approach involves a ‘classifier’ being learned for all bar one of a sample and tested on that one data point [ 83 ].…”
Section: Dataset Tools and Techniques Appliedmentioning
confidence: 99%
“…However, there are plenty of public available gene expression datasets commonly used by researchers in the field of cancer selection and classification experiments. Lists of the most publicly available colon cancer datasets can be found in [6,8,26,27,28].…”
Section: Background Of Datasetsmentioning
confidence: 99%
“…A set of algorithms that have previously demonstrated effectiveness in solving classification problems applied in machine learning studies has been adopted for the current study [19]. Chitode and Nogari defined classification as the process of discovering a prototype that designates and discriminates among different data classes (types) [8]. Classification accuracy is measured in terms of the proportion of expected samples to the overall number of samples, as represented in Equation 1 below [31]:…”
Section: Background Of Datasetsmentioning
confidence: 99%