2020
DOI: 10.1088/2632-2153/aba9ee
|View full text |Cite
|
Sign up to set email alerts
|

Determination of latent dimensionality in international trade flow

Abstract: Currently, high-dimensional data is ubiquitous in data science, which necessitates the development of techniques to decompose and interpret such multidimensional (aka tensor) datasets. Finding a low dimensional representation of the data, that is, its inherent structure, is one of the approaches that can serve to understand the dynamics of low dimensional latent features hidden in the data. Moreover, decomposition methods with non-negative constraints are shown to extract more insightful factors. Nonnegative R… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…The stability approach was recently utilized for RASCAL [26]. Here we integrated our algorithm with stability approach model selection, and for completeness, provide the main steps of this approach below:…”
Section: Rescal With Automatic Model Selectionmentioning
confidence: 99%
See 1 more Smart Citation
“…The stability approach was recently utilized for RASCAL [26]. Here we integrated our algorithm with stability approach model selection, and for completeness, provide the main steps of this approach below:…”
Section: Rescal With Automatic Model Selectionmentioning
confidence: 99%
“…To address the bottlenecks associated with decomposing such large tensors, we introduce a new efficient distributed algorithm for non-negative RESCAL factorization with a superior scaling and speed, capable of working on modern heterogeneous CPU/GPU architectures to factorize extralarge dense and sparse tensors. We integrated our algorithm with a model selection method based on the stability of the extracted latent communities (the columns of matrix A) [26], and call it pyDRESCALk. We evaluate pyDRESCALk on several extra-large synthetic tensors as well as on realworld data, and show that in all cases the predicted latent communities are highly correlated with the predetermined ground-truth solutions and pyDRESCALk determines accurately the latent dimension.…”
Section: Introductionmentioning
confidence: 99%
“…If we perfectly reconstruct the input tensor, our tensor factorization carries little information about peer groups or other shared structure; if our rank is too low, we lose vital information. Truong et al discussed this problem in [18], and introduced the Non-Negative RESCAL method. Minimal multirank in RESCAL is chosen to be the rank with low relative error and high silhouette score.…”
Section: X X ≈Xmentioning
confidence: 99%