2006
DOI: 10.3844/jcssp.2006.740.745
|View full text |Cite
|
Sign up to set email alerts
|

A framework to Deal with Missing Data in Data Sets

Abstract: Most information systems usually have some missing values due to unavailable data. Missing values minimizing the quality of classification rules generated by a data mining system. Missing vales also affecting the quantity of classification rules achieved by the data mining system. Missing values could influence the coverage percentage and number of reducts generated. Missing values lead to the difficulty of extracting useful information from that data set. Solving the problem of missing data is of a high prior… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2007
2007
2021
2021

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 19 publications
0
6
0
Order By: Relevance
“…Removing such noise will improve the accuracy of the dataset. Al-Shalabi et al (2006) highlighted some reasons for missing data including the followings: the value is not relevant to a particular case, not recorded, or ignored because of privacy concerns. One of the solutions for missing data is to delete all records that have missing data (Dempster et al, 1977).…”
Section: Data Preprocessingmentioning
confidence: 99%
“…Removing such noise will improve the accuracy of the dataset. Al-Shalabi et al (2006) highlighted some reasons for missing data including the followings: the value is not relevant to a particular case, not recorded, or ignored because of privacy concerns. One of the solutions for missing data is to delete all records that have missing data (Dempster et al, 1977).…”
Section: Data Preprocessingmentioning
confidence: 99%
“…The proportion of missing data in some samples or features is relatively large. They are meaningless for research because they contained little useful information [ 20 ]. So, this study chooses to delete them directly.…”
Section: Methodsmentioning
confidence: 99%
“…Zou [121] presents a framework based on the number of records, number of attributes, number of symbolic attributes, database entropy, number of classes, and missing data rate. Shalabi et al [98] propose a framework that implements four techniques for treating missing data, and the choice of the technique to be used is based on the rough sets theory.…”
Section: Procedures Based On Direct Manipulation Of Missing Datamentioning
confidence: 99%