The K-means algorithm is extended to allow for partitioning of skewed groups. Our algorithm is called TiK-means and contributes a K-means-type algorithm that assigns observations to groups while estimating their skewness-transformation parameters. The resulting groups and transformation reveal general-structured clusters that can be explained by inverting the estimated transformation. Further, a modification of the jump statistic chooses the number of groups. Our algorithm is evaluated on simulated and real-life data sets and then applied to a long-standing astronomical dispute regarding the distinct kinds of gamma ray bursts.
K E Y W O R D SBATSE, gamma ray bursts, inverse hyperbolic sine transformation, jump selection plot, K-means
INTRODUCTIONClustering observations into distinct groups of homogeneous observations [19,24,36,37,49,56,61,64,67] is important for many applications, for example, taxonomical classification [53], market segmentation [28], color quantization of images [10,46], and software management [43]. The task is generally challenging with many heuristic [19,20,33,34,36,42] or more formal statistical model-based [24,50,51,65] approaches. However, the K-means algorithm [27,41,42] that finds partitions locally minimizing the within-sums-of-squares (WSS) is the most commonly used method. This algorithm depends on good initialization [10,44] and ideally suited to find homogeneous spherically dispersed groups. Further, K-means itself does not provide the number of groups K, for which many methods [26,36,39,46,54,59,62] exist. Nevertheless, it is used (in fact, more commonly abused) extensively because of its computational speed and simplicity. Many adaptations of K-means exist. The algorithm was recently modified for partially observed data sets [16,40]. The K-medoids algorithm [35,58] is a robust alternative that decrees each cluster center to be a medoid (or exemplar) that is an observation in the data set. The K-median Stat Anal Data Min: The ASA Data Sci Journal. 2019;12:223-233.wileyonlinelibrary.com/sam