2018
DOI: 10.1007/978-3-030-03146-6_37
|View full text |Cite
|
Sign up to set email alerts
|

Big Data Preprocessing for Modern World: Opportunities and Challenges

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
3
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(6 citation statements)
references
References 13 publications
0
3
0
Order By: Relevance
“…Consequently, for cleaning the gathered data from anomalies and noises, a preprocessing step should be performed. Also, to cope with the big data problems [60,61], the proper feature selection processes should be applied to reduce the dimensions for simplifying the process of classification. Therefore, addressing the related issues to collected data has a significant impact on effectiveness of classification methods.…”
Section: Data Acquiringmentioning
confidence: 99%
“…Consequently, for cleaning the gathered data from anomalies and noises, a preprocessing step should be performed. Also, to cope with the big data problems [60,61], the proper feature selection processes should be applied to reduce the dimensions for simplifying the process of classification. Therefore, addressing the related issues to collected data has a significant impact on effectiveness of classification methods.…”
Section: Data Acquiringmentioning
confidence: 99%
“…Therefore, a preprocessing process is required to be performed to clean the collected data from noises and detected anomalies. In addition, to address the involved big data challenges, the feature selection techniques are used for the aim of dimensions reduction to store medical data in the cloud storage to simplify the classification phase for disease diagnosis and prediction process. Moreover, since the outcomes of data mining process and derived analytics are extremely related to the collected data, thus, besides the IoTD, the vital information including patients' habitual data and also medical history records are collected and applied to attain more accurate predictions and precise disease diagnosis.…”
Section: Resultsmentioning
confidence: 99%
“…Dealing with missing data is a complex task and while there is no perfect solution, several strategies are available (Farhangfar et al, 2007): Although removing instances with missing values is the simplest method, it can lead to biased results or a loss of information (Alexandropoulos et al, 2019;Little and Rubin, 2002). Therefore, imputation is often used to establish a statistical relationship between the missing data and the other instances (tuples) in the dataset (Prakash et al, 2019).…”
Section: Handle the Missing Datamentioning
confidence: 99%
“…A complete collection of data preprocessing techniques was provided by García et al (2015), highlighting the gaps in real data caused by various factors, along with the most relevant proposed solutions. In addition, García et al (2016); Prakash et al (2019) introduced detailed data preprocessing methods for data mining in the context of big data. The selection methodology of techniques has been extensively discussed by Han et al (2012); Subasi (2020) to help researchers choose the appropriate techniques for data analysis.…”
Section: Introductionmentioning
confidence: 99%