2012
DOI: 10.1002/wics.1231
|View full text |Cite
|
Sign up to set email alerts
|

Principal component analysis for interval data

Abstract: Principal component analysis for classical data is a method used frequently to reduce the effective dimension underlying a data set from p random variables to s p linear functions of those p random variables and their observed values. With contemporary large data sets, it is often the case that the data are aggregated in some meaningful scientific way such that the resulting data are symbolic data (such as lists, intervals, histograms, and the like); though symbolic data can and do occur naturally and in small… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0
1

Year Published

2016
2016
2023
2023

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 20 publications
(10 citation statements)
references
References 13 publications
0
9
0
1
Order By: Relevance
“…Consequently, several data analysis tools, including regression [23], [24], canonical analysis [25], and multi-dimensional scaling [26], have been developed for symbolic and interval-valued data. Given the popularity of PCA in data analysis, several interval-valued PCA algorithms have also been proposed [27], [28], [29], [30], most of which leverage the specific statistical and geometric meanings of principal components of a system of variables. As discussed above, interval NMF and PMF [9] also have been studied to resolve alignment approximation in face analysis and rating approximation in collaborative filtering.…”
Section: Analysis Of Symbolic and Interval-valued Datamentioning
confidence: 99%
“…Consequently, several data analysis tools, including regression [23], [24], canonical analysis [25], and multi-dimensional scaling [26], have been developed for symbolic and interval-valued data. Given the popularity of PCA in data analysis, several interval-valued PCA algorithms have also been proposed [27], [28], [29], [30], most of which leverage the specific statistical and geometric meanings of principal components of a system of variables. As discussed above, interval NMF and PMF [9] also have been studied to resolve alignment approximation in face analysis and rating approximation in collaborative filtering.…”
Section: Analysis Of Symbolic and Interval-valued Datamentioning
confidence: 99%
“…Initially, the use of interval data is motivated by the need to quickly and efficiently monitor large datasets [28], in addition to its ability to deal with missing values without the need to remove entire samples. Generating intervals by aggregation is a form of batch processing, which may not always be ideal.…”
Section: Moving Window Interval Data Aggregationmentioning
confidence: 99%
“…Consequently, several data analysis tools, including regression [23], [24], canonical analysis [25], and multi-dimensional scaling [26], have been developed for symbolic and interval-valued data. Given the popularity of PCA in data analysis, several interval-valued PCA algorithms have also been proposed [27], [28], [29], [30], most of which leverage the specific statistical and geometric meanings of principal components of a system of variables. As discussed above, interval NMF and PMF [9] also have In contrast, we develop a more general interval-valued latent semantic alignment algorithm which can be integrated in common matrix factorization approaches that directly leverages interval-valued properties.…”
Section: Analysis Of Symbolic and Interval-valued Datamentioning
confidence: 99%