2004
DOI: 10.1016/j.artmed.2003.06.001
|View full text |Cite
|
Sign up to set email alerts
|

A constrained-syntax genetic programming system for discovering classification rules: application to medical data sets

Abstract: This paper proposes a new constrained-syntax genetic programming (GP) algorithm for discovering classification rules in medical data sets. The proposed GP contains several syntactic constraints to be enforced by the system using a disjunctive normal form representation, so that individuals represent valid rule sets that are easy to interpret. The GP is compared with C4.5, a very well-known decision-tree-building algorithm, and with another GP that uses boolean inputs (BGP), in five medical data sets: Chest pai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
58
0
3

Year Published

2006
2006
2020
2020

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 107 publications
(61 citation statements)
references
References 18 publications
(25 reference statements)
0
58
0
3
Order By: Relevance
“…GEPCLASS used the following parameters: population size: 30 individuals; stopping criterion: 50 generations; number of genes per chromosome: 2; linking function: logical and; head size range: 6-10; selection method: stochastic tournament; genetic operators: all those defined by the original GEP [3] with the same probabilities. All results reported in this work were obtained by performing a 5-fold cross-validation procedure [5], and using exactly the same data partitions as in [2]. Table 1 shows the accuracy rate for the four data sets, using C4.5, CSGP, BGP, and GEPCLASS.…”
Section: Computational Experiments and Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…GEPCLASS used the following parameters: population size: 30 individuals; stopping criterion: 50 generations; number of genes per chromosome: 2; linking function: logical and; head size range: 6-10; selection method: stochastic tournament; genetic operators: all those defined by the original GEP [3] with the same probabilities. All results reported in this work were obtained by performing a 5-fold cross-validation procedure [5], and using exactly the same data partitions as in [2]. Table 1 shows the accuracy rate for the four data sets, using C4.5, CSGP, BGP, and GEPCLASS.…”
Section: Computational Experiments and Resultsmentioning
confidence: 99%
“…Therefore, we compared GEPCLASS with a constrainedsyntax genetic programming (CSGP) system proposed by [2], over four realworld data sets. In that work there is also a comparison with a "Booleanized" version of genetic programming (BGP) [1] that we reproduced here.…”
Section: Computational Experiments and Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In (Bojarczuk 2004), a number of datasets were analyzed using genetic programming, including two breast cancer datasets and an adrenocortical cancer dataset. The first breast cancer dataset is the Wisconsin breast cancer dataset described in the UC Irvine Machine Learning Data Repository (MLDR).…”
Section: Other Cancer Related Data Analysesmentioning
confidence: 99%