2021
DOI: 10.1111/exsy.12785
|View full text |Cite
|
Sign up to set email alerts
|

Hybrid diabetes disease prediction framework based on data imputation and outlier detection techniques

Abstract: In the field of medical science, accurate prediction is a difficult and challenging task. But, the presence of missing values and outliers can make the prediction task more complicated. Many researchers address the issue of missing value in medical data, either detect the missing value and delete the respective data instances from the dataset or adopt some default methods such as mean, median, neighbour etc., for filling the missing value. However, both methods are lacking to produce optimal results. Furthermo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 81 publications
0
3
0
Order By: Relevance
“…The first one is to remove those rows with the outliers altogether because the outlier's percentage is less. The second method is data imputation by random sampling from a normal distribution with the standard deviation from the mean (Srivastava et al, 2022).…”
Section: Data Cleaningmentioning
confidence: 99%
“…The first one is to remove those rows with the outliers altogether because the outlier's percentage is less. The second method is data imputation by random sampling from a normal distribution with the standard deviation from the mean (Srivastava et al, 2022).…”
Section: Data Cleaningmentioning
confidence: 99%
“…1 illustrates the few applications of IoT devices. It is analyzed that collected data are in huge amount and existing tools are not capable to process and interpret such massive data (Srivastava, Kumar and Singh, 2022). IoT with data analytics, edge computing and fog computing are capable to provide state of art solutions for many healthcare applications such as health monitoring, diagnosis and prediction of diseases, emergence services, resource allocation, and elderly care.…”
Section: Introductionmentioning
confidence: 99%
“…[2] However, not always these data resources yield complete datasets. Even though the majority of missing values are caused by manual data entry, missing data might have occurred due to several factors such as human error, noise generation during transformation, equipment and measurement error, lack of response [3,4,5,6]. The first approach is eliminating missing values and using the rest of the data as a complete dataset [7].…”
Section: Introductionmentioning
confidence: 99%