2009 Ninth IEEE International Conference on Data Mining 2009
DOI: 10.1109/icdm.2009.59
|View full text |Cite
|
Sign up to set email alerts
|

Improving SVM Classification on Imbalanced Data Sets in Distance Spaces

Abstract: Abstract-Imbalanced data sets present a particular challenge to the data mining community. Often, it is the rare event that is of interest and the cost of misclassifying the rare event is higher than misclassifying the usual event. When the data is highly skewed toward the usual, it can be very difficult for a learning system to accurately detect the rare event. There have been many approaches in recent years for handling imbalanced data sets, from under-sampling the majority class to adding synthetic points t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
20
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 40 publications
(20 citation statements)
references
References 23 publications
0
20
0
Order By: Relevance
“…This is due to the imbalanced numbers of instances. On the contrary, G-means and F-measure have been adopted as the experimental metric for class imbalance (Koknar-Tezel Latecki, 2009;Imam, et al,2006). Moreover, which metric should be applied to the experiment depends also on the classification purpose and the domain of the benchmark datasets.…”
Section: Performance Measurementsmentioning
confidence: 99%
See 4 more Smart Citations
“…This is due to the imbalanced numbers of instances. On the contrary, G-means and F-measure have been adopted as the experimental metric for class imbalance (Koknar-Tezel Latecki, 2009;Imam, et al,2006). Moreover, which metric should be applied to the experiment depends also on the classification purpose and the domain of the benchmark datasets.…”
Section: Performance Measurementsmentioning
confidence: 99%
“…1; the minority class to build a less imbalanced data (Akbani, et al, 2004;Haibo Garcia, 2009;Koknar-Tezel Latecki, 2009). In addition, sampling methods have been proved to improve the classifier performance (Haibo Garcia, 2009;Koknar-Tezel Latecki, 2009). For example, in (Japkowicz, 2000), the author examined the dual techniques of under-sampling and over-sampling and the empirical analysis demonstrated the efficiency of both methods.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations