2023
DOI: 10.18637/jss.v106.i09
|View full text |Cite
|
Sign up to set email alerts
|

intRinsic: An R Package for Model-Based Estimation of the Intrinsic Dimension of a Dataset

Abstract: This article illustrates intRinsic, an R package that implements novel state-of-the-art likelihood-based estimators of the intrinsic dimension of a dataset, an essential quantity for most dimensionality reduction techniques. In order to make these novel estimators easily accessible, the package contains a small number of high-level functions that rely on a broader set of efficient, low-level routines. Generally speaking, intRinsic encompasses models that fall into two categories: homogeneous and heterogeneous … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 41 publications
0
4
0
Order By: Relevance
“…A closed-form expression for the posterior distribution is not available, so we rely on Markov Chain Monte Carlo (MCMC) techniques to simulate a posterior sample. The interested reader can find more technical discussions of this model specification and the validity of the underlying hypothesis in the Supplementary Material of related papers 14 , 15 . In these references, one can also find more details about the Gibbs sampler algorithm used for fitting the model and the post-processing tools adopted to deal with computational issues such as label-switching.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…A closed-form expression for the posterior distribution is not available, so we rely on Markov Chain Monte Carlo (MCMC) techniques to simulate a posterior sample. The interested reader can find more technical discussions of this model specification and the validity of the underlying hypothesis in the Supplementary Material of related papers 14 , 15 . In these references, one can also find more details about the Gibbs sampler algorithm used for fitting the model and the post-processing tools adopted to deal with computational issues such as label-switching.…”
Section: Methodsmentioning
confidence: 99%
“…A sparse mixture modelling approach 19 , 20 is employed in this analysis, with mixture components, and for the Dirichlet priors of the mixture weights. Three matrices are produced as the output 15 : Membership labels (dim: ) where each column contains the MCMC sample of the membership labels for every observation; Cluster probabilities (dim: ) where each column contains the MCMC sample of the mixing weights for each mixture component; Intrinsic dimensions (dim: ) where each column contains an MCMC sample for every ID parameter estimated in each cluster. The MCMC chains produced by Hidalgo may exhibit label-switching issues, which prevents direct extraction of inference from the MCMC output.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations