2005
DOI: 10.1007/11574798_3
|View full text |Cite
|
Sign up to set email alerts
|

Dealing with Missing Data: Algorithms Based on Fuzzy Set and Rough Set Theories

Abstract: Abstract. Missing data, commonly encountered in many fields of study, introduce inaccuracy in the analysis and evaluation. Previous methods used for handling missing data (e.g., deleting cases with incomplete information, or substituting the missing values with estimated mean scores), though simple to implement, are problematic because these methods may result in biased data models. Fortunately, recent advances in theoretical and computational statistics have led to more flexible techniques to deal with the mi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0
1

Year Published

2009
2009
2021
2021

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(4 citation statements)
references
References 24 publications
0
3
0
1
Order By: Relevance
“…Once all descriptors and fingerprints were calculated for each SMILES record in the dataset, preprocessing was performed on the dataset. Proper preprocessing results in clean, correct, and complete data while improper preprocessing may lead to poor results and affect the accuracy and efficiency of the algorithm ( Li et al, 2005a ).…”
Section: Methodsmentioning
confidence: 99%
“…Once all descriptors and fingerprints were calculated for each SMILES record in the dataset, preprocessing was performed on the dataset. Proper preprocessing results in clean, correct, and complete data while improper preprocessing may lead to poor results and affect the accuracy and efficiency of the algorithm ( Li et al, 2005a ).…”
Section: Methodsmentioning
confidence: 99%
“…Rough set theory was proposed by Pawlak as a mathematical framework to conceptualize inaccurate, vague, or uncertain data 4,2 . Various applications have been developed in Artificial Intelligence using rough sets, like clustering and classification tools 1,9,17 .…”
Section: Rough Setsmentioning
confidence: 99%
“…An important source of uncertainty to which much attention has not been given in WLA models is uncertainty due to partial ignorance due to missing data or inadequate data. Some previous methods of handling missing data such as deleting cases with incomplete information, or substituting the missing values with estimated mean scores may result in biased data models (Li et al, 2005). Chen and Ma (2007) have quantified the influence of uncertainty of the design flow on WLA by considering it as an uncertain input parameter by computing 95% confidence intervals for the mean values of stream flow.…”
Section: Introductionmentioning
confidence: 99%