2019
DOI: 10.18637/jss.v091.i10
|View full text |Cite
|
Sign up to set email alerts
|

Beyond Tandem Analysis: Joint Dimension Reduction and Clustering in R

Abstract: We present the R package clustrd which implements a class of methods that combine dimension reduction and clustering of continuous or categorical data. In particular, for continuous data, the package contains implementations of factorial K-means and reduced K-means; both methods combine principal component analysis with K-means clustering. For categorical data, the package provides MCA K-means, i-FCB and cluster correspondence analysis, which combine multiple correspondence analysis with K-means. Two examples … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
42
0
4

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 51 publications
(47 citation statements)
references
References 20 publications
1
42
0
4
Order By: Relevance
“…Hierarchical clustering was performed with the ComplexHeatmap (Gu et al, 2016) R package (clustering distance = Manhattan, clustering method = Ward.D2). Cluster correspondence analysis (van de Velden et al, 2017) of the 45 categorical variables (combinations of histone marks and decision-tree labels) across the 8,030 selected genes was performed with the R package clustrd (Markos et al, 2019). To select the optimal number of clusters and dimensions, we first run the function tuneclus() with the following parameters: nclusrange = 3:10, ndimrange = 2:9, method = "clusCA", nstart = 100, seed = 1234.…”
Section: Clustering Analysismentioning
confidence: 99%
“…Hierarchical clustering was performed with the ComplexHeatmap (Gu et al, 2016) R package (clustering distance = Manhattan, clustering method = Ward.D2). Cluster correspondence analysis (van de Velden et al, 2017) of the 45 categorical variables (combinations of histone marks and decision-tree labels) across the 8,030 selected genes was performed with the R package clustrd (Markos et al, 2019). To select the optimal number of clusters and dimensions, we first run the function tuneclus() with the following parameters: nclusrange = 3:10, ndimrange = 2:9, method = "clusCA", nstart = 100, seed = 1234.…”
Section: Clustering Analysismentioning
confidence: 99%
“…Our primary sample consisted of 1210 adults—557 women and 652 men, ranging in age from 18 to 80 ( M = 30.90, SD = 11.99, Mdn = 27)—who completed the HSQ using a 5-point rating scale. Some of the responses ( n = 810) come from data collected by open psychometrics and shared in the R package clustrd [ 19 ]; we collected the remaining responses ( n = 400) using the Prolific.co survey panel. This 5-point dataset is the primary sample reported throughout the manuscript.…”
Section: Methodsmentioning
confidence: 99%
“…For tandem clustering, we use the R package "clustrd" on CRAN (https://CRAN.R-project. org/package=clustrd) (Markos et al 2019) in which RKM and FKM are implemented. As a baseline, simple LBG k-means (Linde et al 1980) and k-means with initialization procedure 12 (KM_I12) (Steinley and Brusco 2007) were used.…”
Section: Algorithms Combining Dr With Clusteringmentioning
confidence: 99%