2012
DOI: 10.18637/jss.v046.i06
|View full text |Cite
|
Sign up to set email alerts
|

HDclassif: AnRPackage for Model-Based Clustering and Discriminant Analysis of High-Dimensional Data

Abstract: This paper presents the R package HDclassif which is devoted to the clustering and the discriminant analysis of high-dimensional data. The classification methods proposed in the package result from a new parametrization of the Gaussian mixture model which combines the idea of dimension reduction and model constraints on the covariance matrices. The supervised classification method using this parametrization is called high dimensional discriminant analysis (HDDA). In a similar manner, the associated clustering … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
74
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
6
1
1

Relationship

4
4

Authors

Journals

citations
Cited by 82 publications
(76 citation statements)
references
References 24 publications
2
74
0
Order By: Relevance
“…Firstly, the package HDclassif [7] for the R software provides routines for model-based clustering and classification of high-dimensional data. In particular, the hddc function implements the subspace clustering method of [20].…”
Section: Subspace Clustering Methodsmentioning
confidence: 99%
“…Firstly, the package HDclassif [7] for the R software provides routines for model-based clustering and classification of high-dimensional data. In particular, the hddc function implements the subspace clustering method of [20].…”
Section: Subspace Clustering Methodsmentioning
confidence: 99%
“…We can start doing cluster analysis on neighborhoods based on those vectors. Since the number of dimensions (120) is larger than the number of individuals (42), the clustering method High-Dimensional Data Clustering (HDDC) [12]), which is implemented in the R package HDclassif [13], is used for cluster analysis. This method has two desirable properties: the ability of dealing with high-dimensional low-sample data, and the optimal number of clusters automatically decided based on Bayesian Information Criterion.…”
Section: Cluster Analysis On Neighborhoodsmentioning
confidence: 99%
“…We here focus on the problem of discriminant analysis and we consider the 3-class NIR data set presented in [26]. The 3-class NIR data set contains 221 NIR spectra of manufactured textiles of various compositions, the classification problem consisting in the determination of a physical property which can take three discrete values [7]. The NIR spectra were measured on a XDS rapid content analyzer instrument (FOSS) in reflectance mode in the range 1100-2500 nm at 0.5 nm apparent resolution (2800 data points per spectrum).…”
Section: Why It Is Important Not To Reduce the Dimension Before Classmentioning
confidence: 99%
“…We consider the HDDA method which is implemented in the HDclassif package [7] for R. HDDA is the supervised classification method associated with the HD-GMM model presented in Section 4.2 and it is a subspace classification method.…”
Section: Discriminant Analysismentioning
confidence: 99%