2010
DOI: 10.5120/1148-1503
|View full text |Cite
|
Sign up to set email alerts
|

Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction

Abstract: Clustering analysis is the problem of partitioning a set of objects O = {o1… on} into c self-similar subsets based on available data. In general, clustering of unlabeled data poses three major problems: 1) assessing cluster tendency, i.e., how many clusters to seek? 2) Partitioning the data into c meaningful groups, and 3) validating the c clusters that are discovered. We address the first problem, i.e., determining the number of clusters c prior to clustering. Many clustering algorithms require number of clus… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2010
2010
2021
2021

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 6 publications
0
3
0
Order By: Relevance
“…Asadi et al [6] represent the structure of unlabeled data sets as a RDI (Reordered Dissimilarity Image). RDI highlights potential clusters as a set of "dark block" along the diagonal of image.…”
Section: Related Workmentioning
confidence: 99%
“…Asadi et al [6] represent the structure of unlabeled data sets as a RDI (Reordered Dissimilarity Image). RDI highlights potential clusters as a set of "dark block" along the diagonal of image.…”
Section: Related Workmentioning
confidence: 99%
“…However, they require the output of a clustering process while SSDA could run without a related clustering process. Some other schemes, like DBSCAN [19] or VAT-based schemes [4,46,61] can operate independently as well, but they incur large computational overhead, i.e., O(l 2 ). On the other hand, SSDA requires only O(l) overhead to achieve a similar accuracy level.…”
Section: The Impact Of Snr On Ssda Detection Accuracymentioning
confidence: 99%
“…In Figure 3.11(b), we compare the accuracy between EA-k model and the SSDA algorithm in [28], which can estimate up to 4 colliding tags. We implement SSDA using the original settings in [28] and set the k range to [1,4] for fair comparison. We observe from Figure 3.11(b) that SSDA achieves higher accuracy (≈ 90%) than our EA-k model does.…”
Section: About Vertical Informationmentioning
confidence: 99%