“…Meanwhile, there also has been a lot of work on the parallel K-means algorithm and implementation, such as parallel implementation based on SIMD hypercube network [25,29], master/slave message passing architecture [11,13,21,22,35], shared memory multi-core processor [30,3], GPU [12] and MapReduce programming model [36,2]. Based on the probability-based seeding approach K-Means++ [6], Bahmani et al proposed a parallel seeding approach that can find a good initial set of centers rapidly [8].…”