2021
DOI: 10.6339/jds.201301_11(1).0005
|View full text |Cite
|
Sign up to set email alerts
|

A New Procedure of Clustering Based on Multivariate Outlier Detection

Abstract: Clustering is an extremely important task in a wide variety of application domains especially in management and social science research. In this paper, an iterative procedure of clustering method based on multivariate outlier detection was proposed by using the famous Mahalanobis distance. At first, Mahalanobis distance should be calculated for the entire sample, then using T 2-statistic fix a UCL. Above the UCL are treated as outliers which are grouped as outlier cluster and repeat the same procedure for the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 18 publications
0
5
0
Order By: Relevance
“…On the other hand, as mentioned in many studies, K-means is sensitive to noise and outliers, and may not give accurate results ( Hodge & Austin, 2004 ; Gan & Ng, 2017 ). Alternatively, K-medoids or partition around medoids (PAM) is less sensitive to local minima problem and, therefore, some studies targeted to use these hard clustering algorithms in outlier detection ( Jayakumar & Thomas, 2013 ; Kumar, Kumar & Singh, 2013 ). However, the hard clustering algorithms such as K-means and PAM force each data point to belong to the nearest cluster.…”
Section: Related Workmentioning
confidence: 99%
“…On the other hand, as mentioned in many studies, K-means is sensitive to noise and outliers, and may not give accurate results ( Hodge & Austin, 2004 ; Gan & Ng, 2017 ). Alternatively, K-medoids or partition around medoids (PAM) is less sensitive to local minima problem and, therefore, some studies targeted to use these hard clustering algorithms in outlier detection ( Jayakumar & Thomas, 2013 ; Kumar, Kumar & Singh, 2013 ). However, the hard clustering algorithms such as K-means and PAM force each data point to belong to the nearest cluster.…”
Section: Related Workmentioning
confidence: 99%
“…Clustering is one of the informal ways to identifying outliers (Jayakumar & Thomas, 2013;Johnson & Wichern, 2002). The aim of clustering is to group a set of observations into clusters based on similarities or distances (dissimilarities) (Irani et al, 2016;Johnson & Wichern, 2002).…”
Section: Cluster Analysismentioning
confidence: 99%
“…The most popular and easiest way to compute similarity is Euclidean distance but it does not take into account the covariance structure and is not appropriate for multivariate data (Almeida et al, 2007). Studies such as Hardin and Rocke (2004) and Jayakumar and Thomas (2013) used Mahalanobis distance as a similarity measure. In clustering, outliers are defined as observations that is far from any clusters or have large distance from the centre of each cluster (Hardin & Rocke, 2004;Zhang, 2013).…”
Section: Cluster Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…Anomaly detection is an unsupervised target detection technique where no prior knowledge about the target or the background is available, focusing on distinguishing unusual material from a typical background (Shaw and Manolakis, 2002). Mahalanobis Distance (MD) is based on correlations between variables and can be used to identify and analyze different patterns (Jayakumar and Thomas, 2013). MD has been used for many different purposes, including detection of outliers (De Maesschalck, Jouan-Rimbaud and Massart, 2000).…”
Section: Introductionmentioning
confidence: 99%