2018
DOI: 10.1002/sam.11379
|View full text |Cite
|
Sign up to set email alerts
|

The next‐generation K‐means algorithm

Abstract: Typically, when referring to a model‐based classification, the mixture distribution approach is understood. In contrast, we revive the hard‐classification model‐based approach developed by Banfield and Raftery (1993) for which K‐means is equivalent to the maximum likelihood (ML) estimation. The next‐generation K‐means algorithm does not end after the classification is achieved, but moves forward to answer the following fundamental questions: Are there clusters, how many clusters are there, what are the statist… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(10 citation statements)
references
References 33 publications
0
10
0
Order By: Relevance
“…Reads per kilobase of exon model per million mapped reads (RPKM) was used to evaluate the genes expression in each sample. Clustering of organs in control group was performed using scaled RPKM by K-means algorithm (Demidenko, 2018 ) with 10,000 iterations. DEGs between infected and control were identified by edgeR (Robinson et al, 2010 ) in R program using the threshold ∣log 2 (fold change)∣ ≥ 1 and P value < 0.05.…”
Section: Methodsmentioning
confidence: 99%
“…Reads per kilobase of exon model per million mapped reads (RPKM) was used to evaluate the genes expression in each sample. Clustering of organs in control group was performed using scaled RPKM by K-means algorithm (Demidenko, 2018 ) with 10,000 iterations. DEGs between infected and control were identified by edgeR (Robinson et al, 2010 ) in R program using the threshold ∣log 2 (fold change)∣ ≥ 1 and P value < 0.05.…”
Section: Methodsmentioning
confidence: 99%
“…The samples were then clustered with the k-means algorithm (nstart = 25, iter.max = 1000). The optimal number of clusters was evaluated with the elbow method [ 99 ] by depicting with-in-Sum-of-Squares (WSS). Clustering results were subsequently visualized via the ‘autoplot’ function from ggfortify v0.4.11 [ 100 ].…”
Section: Methodsmentioning
confidence: 99%
“…K-means methodology is a machine-learning technique that identifies and groups analysis units (in our case BHA) based on their similarities of characteristics. 28 K-means methodology will be used to identify clusters of SARS-CoV-2 incidence by BHA, taking into account the rest of the variables described above. In addition, we will identify the spatial-temporal cluster of SARS-CoV-2 infection incidence and seroprevalence by BHA using SaTScan .…”
Section: Methods and Analysismentioning
confidence: 99%