2015
DOI: 10.1109/tkde.2015.2391123
|View full text |Cite
|
Sign up to set email alerts
|

GDCluster: A General Decentralized Clustering Algorithm

Abstract: Abstract-In many popular applications like peer-to-peer systems, large amounts of data are distributed among multiple sources. Analysis of this data and identifying clusters is challenging due to processing, storage, and transmission costs. In this paper, we propose GDCluster, a general fully decentralized clustering method, which is capable of clustering dynamic and distributed data sets. Nodes continuously cooperate through decentralized gossip-based communication to maintain summarized views of the data set… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 24 publications
(7 citation statements)
references
References 28 publications
0
7
0
Order By: Relevance
“…Decentralized clustering on distributed data using P2P has been studied recently. Some solutions introduce distributed k -means algorithms that construct a global set of artificial points to act as a proxy for the entire dataset [1,16]. There are some solutions that consider P2P random networks and work in static settings, however they are aimed at computing basic average of centroids.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Decentralized clustering on distributed data using P2P has been studied recently. Some solutions introduce distributed k -means algorithms that construct a global set of artificial points to act as a proxy for the entire dataset [1,16]. There are some solutions that consider P2P random networks and work in static settings, however they are aimed at computing basic average of centroids.…”
Section: Related Workmentioning
confidence: 99%
“…The third method is agml, a dicentralized version of P2P k -means that allows nodes to exchage summaries representing their local data, then apply clustering using generated summaries [5]. Also, we implemented gdc as a P2P k -means algothim that that allows nodes to generate and exchange a global set of artificial points to act as a proxy for the entire dataset [16]. Finally, we use golf that is a P2P k -means implemented using gossip protocol [2].…”
Section: K -Means Clustering Algorithmsmentioning
confidence: 99%
“…Recently, two K‐means‐based models, distributed PCA and K‐means and KPCA+ K‐means clustering, were developed based on the PCA concept and kernel PCA concept. Mashayekhi et al proposed GDCluster, a general fully decentralized clustering method, which is capable of clustering dynamic and distributed datasets . In GDCluster, nodes continuously cooperate through decentralized gossip‐based communication to maintain summarized views of the dataset.…”
Section: Data Mining Techniques In Distributed Environmentmentioning
confidence: 99%
“…Mashayekhi et al proposed GDCluster, a general fully decentralized clustering method, which is capable of clustering dynamic and distributed datasets. 126 In GDCluster, nodes continuously cooperate through decentralized gossip-based communication to maintain summarized views of the dataset. Other approaches for DC are still in progress.…”
Section: Distributed Clusteringmentioning
confidence: 99%
“…A number of applications based on Epidemic protocols have been proposed to serve different purposes in different environments. For example, Epidemic protocols have been employed to implement applications for Peer-to-Peer (P2P) overlay networks [1,2,3], distributed computing [4], mobile ad hoc networks (MANET) [5], wireless sensor networks (WSN) [6,7,8,9,10], failure detection [11], distributed data mining [12,13,14] and exascale high performance computing [15,16,17].…”
Section: Introductionmentioning
confidence: 99%