2020
DOI: 10.1155/2020/8824625
|View full text |Cite|
|
Sign up to set email alerts
|

The Use of Hellinger Distance Undersampling Model to Improve the Classification of Disease Class in Imbalanced Medical Datasets

Abstract: Imbalanced class distribution in the medical dataset is a challenging task that hinders classifying disease correctly. It emerges when the number of healthy class instances being much larger than the disease class instances. To solve this problem, we proposed undersampling the healthy class instances to improve disease class classification. This model is named Hellinger Distance Undersampling (HDUS). It employs the Hellinger Distance to measure the resemblance between majority class instance and its neighbouri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 36 publications
0
4
0
Order By: Relevance
“…It is required to balance the data by increasing the minority class or decreasing the majority class (undersampling). The distribution can vary from a slight bias to a severe imbalance [ 15 18 ].…”
Section: Introductionmentioning
confidence: 99%
“…It is required to balance the data by increasing the minority class or decreasing the majority class (undersampling). The distribution can vary from a slight bias to a severe imbalance [ 15 18 ].…”
Section: Introductionmentioning
confidence: 99%
“…Then the space between the closest train data points increased to find the boundaries of the classes which represents the optimal hyperplane. Which eliminates some of the inconsequential data from the training dataset to minimize the classification error [56]. However, when dataset is organically non-linear the SVM uses a kernel function to transform the data to the high-dimensional to provide a separating hyperplane.…”
Section: Support Vector Machine (Svm)mentioning
confidence: 99%
“…This article has been retracted by Hindawi following an investigation undertaken by the publisher [ 1 ]. This investigation has uncovered evidence of one or more of the following indicators of systematic manipulation of the publication process: Discrepancies in scope Discrepancies in the description of the research reported Discrepancies between the availability of data and the research described Inappropriate citations Incoherent, meaningless and/or irrelevant content included in the article Peer-review manipulation …”
mentioning
confidence: 99%
“…This article has been retracted by Hindawi following an investigation undertaken by the publisher [1]. This investigation has uncovered evidence of one or more of the following indicators of systematic manipulation of the publication process:…”
mentioning
confidence: 99%