2020
DOI: 10.14778/3407790.3407813
|View full text |Cite
|
Sign up to set email alerts
|

LOG-Means

Abstract: Clustering is a fundamental primitive in manifold applications. In order to achieve valuable results, parameters of the clustering algorithm, e.g., the number of clusters, have to be set appropriately, which is a tremendous pitfall. To this end, analysts rely on their domain knowledge in order to define parameter search spaces. While experienced analysts may be able to define a small search space, especially novice analysts often define rather large search spaces due to the lack of in-depth domain knowledge. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 36 publications
0
4
0
Order By: Relevance
“…To better highlight the genes that are most impacted by Ume6 depletion, we applied k-means clustering, which groups genes by their Euclidian distance while minimizing variation. Using the “elbow method,” we found k = 3 to be optimal for subdividing our 144 genes in the Ume6 regulon ( Thorndike 1953 ; Fritz et al . 2020 ).…”
Section: Resultsmentioning
confidence: 99%
“…To better highlight the genes that are most impacted by Ume6 depletion, we applied k-means clustering, which groups genes by their Euclidian distance while minimizing variation. Using the “elbow method,” we found k = 3 to be optimal for subdividing our 144 genes in the Ume6 regulon ( Thorndike 1953 ; Fritz et al . 2020 ).…”
Section: Resultsmentioning
confidence: 99%
“…In the specific cases presented in this work, prior information guided the decision on the number of clusters, i.e., the number of synthetically generated intervals in the synthetic example guided us to select four clusters, and the number of lithologies described in the Elvira deposit in the real example guided us to select 18 clusters. In the case of not having any prior information, there are methodologies to estimate in advance the number of classes needed to capture the highest percentage of variability possible, i.e., elbow [30] or Log-Mean [31] methods.…”
Section: Unsupervised Domainingmentioning
confidence: 99%
“…A variety of cluster validity indices have been proposed to determine the cluster number. For instance, DBI, SI, DU, Gap statistic, XBI, PBM, CHI, Elbow method, Hypersphere density-based method [16], Log_means [17], and so on.…”
Section: Indices For Determining Cluster Number and Validation Ofmentioning
confidence: 99%