2016
DOI: 10.1007/978-3-319-46349-0_34
|View full text |Cite
|
Sign up to set email alerts
|

An Optimized k-NN Approach for Classification on Imbalanced Datasets with Missing Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 3 publications
0
7
0
Order By: Relevance
“…Condek et al [12] applied median imputation address missing data and concluded that RF can be used as a cost function providing better results than the naïve approaches of checking every truck or no truck until failure. Ozan et al [13] introduced an optimized k-NN approach to handle missing values and created a tailored k-NN model using a specified HEIM distance. Biteus and Lindgren [14] removed attributes with more than 10% missing values and applied mean imputation to the remaining attributes.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Condek et al [12] applied median imputation address missing data and concluded that RF can be used as a cost function providing better results than the naïve approaches of checking every truck or no truck until failure. Ozan et al [13] introduced an optimized k-NN approach to handle missing values and created a tailored k-NN model using a specified HEIM distance. Biteus and Lindgren [14] removed attributes with more than 10% missing values and applied mean imputation to the remaining attributes.…”
Section: Discussionmentioning
confidence: 99%
“…In terms of publication time, it can be divided into two scopes. Previous studies from 2016 to 2019 March [10][11][12][13][14][15][16] have used many dedicated methods of data imputation and machine learning algorithms to solve the high-class imbalance and missing data problems [17][18][19][20][21][22][23][24]. However, these studies have focused only on the training set.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…This data is heavily imbalanced (74 observations for the "diseased trees" class vs 4,265 observations for "other land cover" class). APS Failure data set was collected from heavy scania trucks in everyday usage [15]. The task is to diagnose whether a truck failure is caused by a component of the air pressure system (APS) from 170 anonymous factors.…”
Section: Methodsmentioning
confidence: 99%
“…Ozan E.C et al [4] discussed an optimized k-NN classifier to estimate missing data on this imbalanced dataset. They handled the missing value problem by using KNN imputation.…”
Section: Related Workmentioning
confidence: 99%