2019
DOI: 10.2478/rput-2019-0029
|View full text |Cite
|
Sign up to set email alerts
|

Impact of Data Normalization on Classification Model Accuracy

Abstract: In this paper, we present the impact of the data normalization on the classification model performance. In first part of this paper, we present the structure of our dataset, where we discuss the features of the data set and basic statistical analysis of the data. In this research, we worked with the medical data about the patients with the Parkinson disease. In second part of this paper, we present the process of data normalization and the impact of scaling data on the classification model performance. In this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
17
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 42 publications
(19 citation statements)
references
References 6 publications
1
17
0
1
Order By: Relevance
“…Patient allergic history is commonly associated cause of adverse effects to many types of drugs and vaccines 13 , and in the case of COVID-19 this has also been reported 16,24 ; allergic-related reactions are found to a significant degree in every data group used in our study. Patient age is another important aspect, where the mortality rate in advanced old-age persons is comparatively higher than in other cases, and previous findings also supported this evidence 15 .…”
Section: Discussionsupporting
confidence: 72%
See 1 more Smart Citation
“…Patient allergic history is commonly associated cause of adverse effects to many types of drugs and vaccines 13 , and in the case of COVID-19 this has also been reported 16,24 ; allergic-related reactions are found to a significant degree in every data group used in our study. Patient age is another important aspect, where the mortality rate in advanced old-age persons is comparatively higher than in other cases, and previous findings also supported this evidence 15 .…”
Section: Discussionsupporting
confidence: 72%
“…Firstly, we calculated the feature importance scores for each distinct feature, for individual machine learning classifiers, and then we normalized the values to render the data with the same scale, i.e. between 0 and 1, by using the min-max normalization technique 13 . This was followed by the mean aggregation of those values as shown in Figure 6.…”
Section: Feature Importance Analysis For Finding Significant Features Using Machine Learning Classifiersmentioning
confidence: 99%
“…The present study focused on the importance of other highly influential factors along with student academic results, such as the students per teacher ratio, the number of schools in a region, whether schools were located in rural or urban areas, the availability or lack of classrooms, electrical facilities in schools, availability or lack of furniture for students, open-air classes, computer lab facilities, science labs, and playgrounds in schools. Previous research [44][45][46][47][48] suggested that data pre-processing (normalisation, discretisation) techniques enhanced classifier performance, as these techniques reduce the biases among features. Furthermore, related studies showed that the min-max normalisation method performed better than other data normalisation methods [49][50][51].…”
Section: Discussionmentioning
confidence: 99%
“…Python software is used to generate the targets, namely Fc, e c , and T The full factorial design for the four factors and 8 levels for each factor gives 4096 training data. The normalization is used to make the data with the same scale, the training data are normalized in the interval [0,1] using equations (3), this improves the regression and classification model's performance and training stability [14], [15].…”
Section: A Data Preparationmentioning
confidence: 99%