Incremental Hyper-Sphere Partitioning for Classification

Yang, Tao; Guan, Sheng-Uei; Song, Jinghao; Zheng, Binge; Cao, Mengying; Yu, Tianlin

doi:10.4018/ijaec.2014040105

Cited by 2 publications

(5 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…IMGMP extends the previous research of IHPP [5] and IHSP [12,13]. It breaks through the restriction of classifiers' shape and raises a self-adaptive method to create classifiers.…”

Section: Conclusion and Further Workmentioning

confidence: 86%

“…In order to flexibly represent the decision boundary, researchers try to combine a set of boundaries to simulate the actual boundary. For instance, Jinghao and Binge use a set of hyper-planes [11] or hyper-spheres [12,13] (HSP [13] use PSO algorithm, and ILEGA [12] use GA algorithm ) to form a complex decision hyper-surface. In generally, the patterns of a class are scatted over the space and form a group of dispersed clusters [14].…”

Section: Introductionmentioning

confidence: 99%

“…IHPP (Incremental Hyper-Plane Partitioning) [11] employs hyper-planes to incrementally slice the space until the area of target class can be distinguished from other classes. IHSP (Incremental Hyper-Spheres Partitioning) [12,13] uses GA or PSO to dynamically search the whole data space to find the clusters of each class and then a group of hyper spheres will be employed to represent the clusters as classifiers. Research also describes above method as an expert system.…”

Section: Introductionmentioning

confidence: 99%

“…In order to flexibly represent the decision boundary, researchers try to combine a set of boundaries to simulate the actual boundary. For instance, Jinghao and Binge use a set of hyper-planes [11] or hyper-spheres [12,13] [12,13] uses GA or PSO to dynamically search the whole data space to find the clusters of each class and then a group of hyper spheres will be employed to represent the clusters as classifiers. Research also describes above method as an expert system.…”

mentioning

confidence: 99%

See 3 more Smart Citations

Incremental Maximum Gaussian Mixture Partition For Classification

Hong¹,

Zhang²,

Guan³

et al. 2017

Proceedings of the 2017 2nd Joint International Information Technology, Mechanical and Electronic Engineering Conference (JIMEC

Self Cite

View full text Add to dashboard Cite

In the field of classification, the main task of most algorithms is to find a perfect decision boundary. However, most decision boundaries are too complex to be discovered directly. Therefore, in this paper, we proposed an Incremental Maximum Gaussian Mixture Partition (IMGMP) algorithm for classification, aiming to solve those problems with complex decision boundaries. As a self-adaptive algorithm, it uses a divide and conquer strategy to calculate out a reasonable decision boundary by step. An Improved K-means clustering and a Maximum Gaussian Mixture model are used in the classifier. This algorithm also has been tested on artificial and real-life datasets in order to evaluate its remarkable flexibility and robustness. IntroductionN the field of machine learning, Artificial Neuron Network, Support Vector Machine [1] (SVM) and other plenty of algorithms are used for solving classification problem [2]. A common idea of these algorithms is trying to find a decision boundary among two classes [3]. However, in practice, discovering of decision boundary is always a tricky question [4,5]. First of all, most of datasets are non-linear separated and the decision boundaries are complex [6,7]. If a dataset's dimension is more than two, its decision boundary is in fact a decision surface and becomes more complex [6,8]. In addition, the boundaries of different classes could intersect in some cases. Though many algorithms can discover complex decision boundaries, they still cannot satisfy all requirements. For instant, SVM [1,9] can improve kernel function to approximately close to the real decision boundary, but few kernel functions cannot satisfy various tasks [10]. In order to flexibly represent the decision boundary, researchers try to combine a set of boundaries to simulate the actual boundary. For instance, Jinghao and Binge use a set of hyper-planes [11] or hyper-spheres [12,13] [12,13] uses GA or PSO to dynamically search the whole data space to find the clusters of each class and then a group of hyper spheres will be employed to represent the clusters as classifiers. Research also describes above method as an expert system. Each cluster will be assigned to an expert, which knows the boundary of the corresponding cluster and the whole experts imply the intact decision surfaces. An expert indicates a range of a class's patterns, and the pattern in this range will be regarded as the same class.Above methods make sense in the practice, but they still have some limitations. First of all, the shape of clusters is complex, so it is not enough if we only use hyper-plane and hyper-sphere to represent it. Secondly, the boundaries of experts are absolute. Once the boundaries of the two classes are mixed, there will be some problems [15]. Finally, there are gap areas outsides the experts and it is difficult to judge whether the patterns fall in the gap.To solve the problems above, IMGMP of this paper uses the ideal of fuzzy learning [16,17] in the expert system. IMGMP will employ Gaussian model [18] as classifiers in t...

show abstract

“…IMGMP extends the previous research of IHPP [5] and IHSP [12,13]. It breaks through the restriction of classifiers' shape and raises a self-adaptive method to create classifiers.…”

Section: Conclusion and Further Workmentioning

confidence: 86%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

“…In order to flexibly represent the decision boundary, researchers try to combine a set of boundaries to simulate the actual boundary. For instance, Jinghao and Binge use a set of hyper-planes [11] or hyper-spheres [12,13] [12,13] uses GA or PSO to dynamically search the whole data space to find the clusters of each class and then a group of hyper spheres will be employed to represent the clusters as classifiers. Research also describes above method as an expert system.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Incremental Maximum Gaussian Mixture Partition For Classification

Hong¹,

Zhang²,

Guan³

et al. 2017

Proceedings of the 2017 2nd Joint International Information Technology, Mechanical and Electronic Engineering Conference (JIMEC

Self Cite

View full text Add to dashboard Cite

show abstract

“…This information shows how far the training datum can influence. That is to say, it is a method that can classify the data sets with any-shape clustering conditions comparing to some methods that select a geometric shape to enclose data within it belonging to a specific class (Song and Guan, 2014). After the training stage, in the space, every training datum contains a region that represents how far this training datum can influence in space.…”

Section: Introductionmentioning

confidence: 99%

Density and Distance Based KNN Approach to Classification

Guan

2016

International Journal of Applied Evolutionary Computation

Self Cite

View full text Add to dashboard Cite

KNN algorithm is a simple and efficient algorithm developed to solve classification problems. However, it encounters problems when classifying datasets with non-uniform density distributions. The existing KNN voting mechanism may lose essential information by considering majority only and get degraded performance when a dataset has uneven distribution. The other drawback comes from the way that KNN treat all the participating candidates equally when judging upon one test datum. To overcome the weaknesses of KNN, a Region of Influence Based KNN (RI-KNN) is proposed. RI-KNN computes for each training datum region of influence information based on their nearby data (i.e. locality information) so that each training datum can encode some locality information from its region. Information coming from both training and testing stages will contribute to the formation of weighting formula. By solving these two problems, RI-KNN is shown to outperform KNN upon several artificial datasets and real datasets without sacrificing time cost much in nearly all tested datasets.

show abstract

Incremental Hyper-Sphere Partitioning for Classification

Cited by 2 publications

References 13 publications

Incremental Maximum Gaussian Mixture Partition For Classification

Incremental Maximum Gaussian Mixture Partition For Classification

Density and Distance Based KNN Approach to Classification

Contact Info

Product

Resources

About