2018
DOI: 10.48550/arxiv.1804.02502
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Principal Component Analysis: A Natural Approach to Data Exploration

Felipe L. Gewers,
Gustavo R. Ferreira,
Henrique F. de Arruda
et al.

Abstract: Principal component analysis (PCA) is often used for analysing data in the most diverse areas. In this work, we report an integrated approach to several theoretical and practical aspects of PCA. We start by providing, in an intuitive and accessible manner, the basic principles underlying PCA and its applications. Next, we present a systematic, though no exclusive, survey of some representative works illustrating the potential of PCA applications to a wide range of areas. An experimental investigation of the ab… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 187 publications
(239 reference statements)
0
6
0
1
Order By: Relevance
“…66 Data standardization is recommended when the raw variables have been measured on a significantly different scale. 67 Other distance-based algorithms such as k-NN, Kmeans, and SVM are also dramatically affected by the scale of the features, while tree-based algorithms and Nai ̈ve Bayes classifiers are less affected. Unfortunately, data normalization is often ignored or not clarified in the environmental application practice of ML.…”
Section: ■ Concluding Remarksmentioning
confidence: 99%
“…66 Data standardization is recommended when the raw variables have been measured on a significantly different scale. 67 Other distance-based algorithms such as k-NN, Kmeans, and SVM are also dramatically affected by the scale of the features, while tree-based algorithms and Nai ̈ve Bayes classifiers are less affected. Unfortunately, data normalization is often ignored or not clarified in the environmental application practice of ML.…”
Section: ■ Concluding Remarksmentioning
confidence: 99%
“…The running index (last 3 months) of the newly created image is maintained for later search and retrieval [5]. Due to the continuous nature of the application, hash-based indexing is the preferred choice over the technique of collectively learning representations from static datasets such as Principal component analysis (PCA) [3]. As the catalogue changes, so do the best key components, which require frequent recalculations.…”
Section: Similar Image Retrieval (Sir)mentioning
confidence: 99%
“…In order to better understand the variation of the diversity with the parameters, we flattened the obtained values of D and calculated the respective PCA (Principal Component Analysis) projection [31,32] (see Figure 4). An interesting result concerns the separation of the cases into three regions in terms of the average degree, identified by respective ellipses in Figure 4.…”
Section: A No Reconnection Constraintmentioning
confidence: 99%