Real-world datasets used for classification often face many challenges when they are imbalanced in nature which is unavoidable and need to be handled by analysts. Many researchers have proposed methods for handling imbalanced datasets and they mostly concentrated on handling binary classification with only two class labels. Only very few research works have been carried out for treating highly imbalanced datasets and fail to handle multiclass datasets. To address imbalance problem in multiclass datasets, this paper proposes a Variable Population sized Particle Swarm Optimization (VPPSO) which is a modified version of Particle Swarm Optimization (PSO) which works based on clustering. PSO usually has fixed population size and has high computational complexity. In order to reduce this, the population size is varied over generations and the particles are loaded into the population iteratively by retaining the balance nature of solutions. PSO optimizes the selection of training and testing samples from each class label in imbalanced datasets for improved classification results. From the implementation results, it is evident that using VPPSO, highly imbalanced datasets with multiclass attributes are classified more efficiently
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.