Constrained Classification of Large Imbalanced Data by Logistic Regression and Genetic Algorithm

Hlosta, Martin; Stríž, Rostislav; Kupčík, Jan; Zendulka, Jaroslav; Hruška, Tomáš

doi:10.7763/ijmlc.2013.v3.305

Cited by 7 publications

(4 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This involves either oversampling instances of the minority class or undersampling instances of the majority class. Oversampling involves the random duplication of instances from minority classes [15][16][17]. Undersampling involves the random removal of instances from majority classes.…”

Section: Data-based Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Boosting Minority Class Prediction on Imbalanced Point Cloud Data

Lin

Nguyen

2020

Applied Sciences

View full text Add to dashboard Cite

Data imbalance during the training of deep networks can cause the network to skip directly to learning minority classes. This paper presents a novel framework by which to train segmentation networks using imbalanced point cloud data. PointNet, an early deep network used for the segmentation of point cloud data, proved effective in the point-wise classification of balanced data; however, performance degraded when imbalanced data was used. The proposed approach involves removing between-class data point imbalances and guiding the network to pay more attention to majority classes. Data imbalance is alleviated using a hybrid-sampling method involving oversampling, as well as undersampling, respectively, to decrease the amount of data in majority classes and increase the amount of data in minority classes. A balanced focus loss function is also used to emphasize the minority classes through the automated assignment of costs to the various classes based on their density in the point cloud. Experiments demonstrate the effectiveness of the proposed training framework when provided a point cloud dataset pertaining to six objects. The mean intersection over union (mIoU) test accuracy results obtained using PointNet training were as follows: XYZRGB data (91%) and XYZ data (86%). The mIoU test accuracy results obtained using the proposed scheme were as follows: XYZRGB data (98%) and XYZ data (93%).

show abstract

Section: Data-based Methodsmentioning

confidence: 99%

“…Another strategy is the threshold-moving technique in which the decision threshold is shifted in a manner that reduces bias towards the negative class [15][16][17]26]. It applies to classifiers that, given an input tuple, return a continuous output value.…”

Section: Algorithm-based Methodsmentioning

confidence: 99%

Boosting Minority Class Prediction on Imbalanced Point Cloud Data

Lin

Nguyen

2020

Applied Sciences

View full text Add to dashboard Cite

show abstract

“…This often results in poorly estimated independent variable coefficients. One way to compensate is to under sample the majority class to rebalance the overall sample (Hostla et al, 2013). To maximise the number of observed data points used in the logistic regressions, we used all data points in the 'teams' category and randomly selected an equal number of data points in the 'no teams' category.…”

Section: Methodsmentioning

confidence: 99%

Workplace innovation and social innovation: an introduction

Howaldt

Oeij

Dhondt

et al. 2016

WREMSD

View full text Add to dashboard Cite

“…Many methods have been presented to deal with the class imbalanced problem using various techniques [10], [11]. The idea of developing the algorithm to build the decision tree classifier that is suitable for classifying an imbalanced dataset is one of the methods that have received wide attention.…”

Section: Introductionmentioning

confidence: 99%

Decision Tree Algorithm with Class Overlapping-Balancing Entropy for Class Imbalanced Problem

Sagoolmuang¹,

Sinapiromsaran²

2020

IJMLC

View full text Add to dashboard Cite

The problem of handling a class imbalanced problem by modifying decision tree algorithm has received widespread attention in recent years. A new splitting measure, called class overlapping-balancing entropy (OBE), is introduced in this paper that essentially pay attentions to all classes equally. Each step, the proportion of each class is balanced via the assigned weighted values. They not only depend on equalizing each class, but they also take into account the overlapping region between classes. The proportion of weighted values corresponding to each class is used as the component of Shannon's entropy for splitting the current dataset. From the experimental results, OBE significantly outperforms the conventional splitting measures like Gini index, gain ratio and DCSM, which are used in the well-known decision tree algorithms. It also exhibits superior performance compared to AE and ME that are designed for handling the class imbalanced problem specifically.

show abstract

Constrained Classification of Large Imbalanced Data by Logistic Regression and Genetic Algorithm

Cited by 7 publications

References 7 publications

Boosting Minority Class Prediction on Imbalanced Point Cloud Data

Boosting Minority Class Prediction on Imbalanced Point Cloud Data

Workplace innovation and social innovation: an introduction

Decision Tree Algorithm with Class Overlapping-Balancing Entropy for Class Imbalanced Problem

Contact Info

Product

Resources

About