2019
DOI: 10.5815/ijisa.2019.12.03
|View full text |Cite
|
Sign up to set email alerts
|

A New Hybrid Genetic and Information Gain Algorithm for Imputing Missing Values in Cancer Genes Datasets

Abstract: A DNA microarray can represent thousands of genes for studying tumor and genetic diseases in humans. Datasets of DNA microarray normally have missing values, which requires an undeniably crucial process for handling missing values. This paper presents a new algorithm, named EMII, for imputing missing values in medical datasets. EMII algorithm evolutionarily combines Information Gain (IG) and Genetic Algorithm (GA) to mutually generate imputable values. EMII algorithm is column-oriented not instance oriented th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 22 publications
0
6
0
Order By: Relevance
“…Table 2 shows the classification confusion matrix, including true positives, false positives, false negatives, and true negatives separately. As a result, a two-by-two confusion matrix (sometimes also referred to as a confusion matrix) was created [ 45 ]. By using the accuracy and the f1-measure, the classification performance could be studied in greater detail, as shown in Table 3 .…”
Section: 3 Active Learning Selection Strategiesmentioning
confidence: 99%
“…Table 2 shows the classification confusion matrix, including true positives, false positives, false negatives, and true negatives separately. As a result, a two-by-two confusion matrix (sometimes also referred to as a confusion matrix) was created [ 45 ]. By using the accuracy and the f1-measure, the classification performance could be studied in greater detail, as shown in Table 3 .…”
Section: 3 Active Learning Selection Strategiesmentioning
confidence: 99%
“…To address this sample imbalance problem, dynamic weighting coefficients of each category are incorporated into the DWSA to conduct the classification task. We use accuracy and F1-score [41] to evaluate the model. The dynamic sample weighting algorithm adjusts the weight coefficients of each category.…”
Section: Dwsa Frameworkmentioning
confidence: 99%
“…The concept is based on the survival of the fittest strategy during sexual reproduction as proposed by Charles Darwin [14]. When dealing with finding optimal solutions and stochastic search [15], then genetic algorithm is the best bet. These algorithms relies on the combination of chromosome populations, selection, crossover, and mutation [16] to produce new offspring.…”
Section: Genetic Algorithmmentioning
confidence: 99%