2010
DOI: 10.5120/1537-140
|View full text |Cite
|
Sign up to set email alerts
|

Multiple Imputation of Missing Data with Genetic Algorithm based Techniques

Abstract: ABSRACTMissing data is one of the major issues in data mining and pattern recognition. The knowledge contains in attributes with missing data values are important in improving decisionmaking process of an organization. The learning process on each instance is necessary as it may contain some exceptional knowledge. There are various methods to handle missing data in decision tree learning. The proposed imputation algorithm is based on the genetic algorithm that uses domain values for that attribute as pool of s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
4
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(6 citation statements)
references
References 18 publications
0
4
0
Order By: Relevance
“…For this reason, a brief test is to compare pieces of data -one with missing observations and the other without missing observations. On a t-test, if there is no mean difference between the two data units, we will expect that the data are MCAR [5]. Anything that is missing and sometimes because this form of missing facts is not often observed and the best manner to ignore these instances, for example: Water damage to paper forms due to flooding before it enters [1], [2] or in a survey, if we get 5% responses missing randomly, it is MCAR [6], [7].…”
Section: Missing Completely At Random (Mcar)mentioning
confidence: 99%
“…For this reason, a brief test is to compare pieces of data -one with missing observations and the other without missing observations. On a t-test, if there is no mean difference between the two data units, we will expect that the data are MCAR [5]. Anything that is missing and sometimes because this form of missing facts is not often observed and the best manner to ignore these instances, for example: Water damage to paper forms due to flooding before it enters [1], [2] or in a survey, if we get 5% responses missing randomly, it is MCAR [6], [7].…”
Section: Missing Completely At Random (Mcar)mentioning
confidence: 99%
“…In [30], a GA is used to impute missing values in discrete features. In [27,22], GA-based imputation is also used for classification with missing values.…”
Section: Evolutionary Computation-based Imputationmentioning
confidence: 99%
“…However, due to mechanical faults and changes in the system behavior, the collected traffic data often have corrupted or missing data points, bringing some error to analysis result [8,9] . The quality of traffic data not only deeply affects the analysis results of traffic flow operation, but also affects the efficiency of the traffic system operation [10][11][12]. For these reasons, increasingly more methods have been developed to measure and improve the traffic data quality in the past.…”
Section: Introductionmentioning
confidence: 99%