2021
DOI: 10.1016/j.is.2020.101710
|View full text |Cite
|
Sign up to set email alerts
|

Fast, scalable and geo-distributed PCA for big data analytics

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 19 publications
0
3
0
Order By: Relevance
“…PCA is an unsupervised statistical method for reducing dimensions of a database by linear combinations of a starting set of variables based on their maximum variance [37] and can convert original variables into new independent variables named principal components (PCs) [7]. It can make a preliminary judgement on the distribution status, natural aggregation, and abnormal samples [38].…”
Section: Discussionmentioning
confidence: 99%
“…PCA is an unsupervised statistical method for reducing dimensions of a database by linear combinations of a starting set of variables based on their maximum variance [37] and can convert original variables into new independent variables named principal components (PCs) [7]. It can make a preliminary judgement on the distribution status, natural aggregation, and abnormal samples [38].…”
Section: Discussionmentioning
confidence: 99%
“…If the dimensionality of X is large, then the computation of the eigenvalues will be time consuming, and may cause memory overflow (e.g. Adnan et al, 2021).…”
Section: Methodsmentioning
confidence: 99%
“…This limitation triggered the develop-ment of PCA extensions, including sparsity assumptions (e.g. Fan et al, 2018;Adnan et al, 2021), and massive parallelism (Lazcano et al, 2017). Battulga et al (2020) suggested a hash-tree PCA to accelerate conventional PCA by sampling similar objects while preserving the original data distribution.…”
Section: Introductionmentioning
confidence: 99%