Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2007
DOI: 10.1145/1277741.1277927
|View full text |Cite
|
Sign up to set email alerts
|

Active learning for class imbalance problem

Abstract: The class imbalance problem has been known to hinder the learning performance of classification algorithms. Various real-world classification tasks such as text categorization suffer from this phenomenon. We demonstrate that active learning is capable of solving the problem.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
79
0

Year Published

2008
2008
2021
2021

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 140 publications
(80 citation statements)
references
References 3 publications
1
79
0
Order By: Relevance
“…Typically, the class imbalance ratio of examples that are close to the decision boundary is lower than the imbalance ratio in the complete data set. Therefore, the active learning approach provides more balanced training examples because it selects examples that lie closest to the separating hyperplane using the support vector machine algorithm [11], [12]. However, the method is designed based on the characteristics of the support vector machine algorithm; thus, the method cannot be applied to other classification algorithms.…”
Section: Related Workmentioning
confidence: 99%
“…Typically, the class imbalance ratio of examples that are close to the decision boundary is lower than the imbalance ratio in the complete data set. Therefore, the active learning approach provides more balanced training examples because it selects examples that lie closest to the separating hyperplane using the support vector machine algorithm [11], [12]. However, the method is designed based on the characteristics of the support vector machine algorithm; thus, the method cannot be applied to other classification algorithms.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, various approaches on active learning from imbalanced data sets have been proposed in literature [1] [22] [23] [24]. In particular, an active learning method based on support vector machines (SVM) was proposed in [23] [24]. Instead of searching the entire training data space, this method can effectively select informative instances from a random set of training populations, therefore significantly reducing the computational cost when dealing with large imbalanced data sets.…”
Section: Introductionmentioning
confidence: 99%
“…Seyda Ertekin et.al. developed an efficient method [30] for active selection of informative instances from a randomly picked small pool of samples. In the proposed method they have maintained the same or achieving even higher g-means values by using less number of training instances in the SVM model.…”
Section: Introductionmentioning
confidence: 99%