2019
DOI: 10.1186/s12859-018-2572-9
|View full text |Cite
|
Sign up to set email alerts
|

A whitening approach to probabilistic canonical correlation analysis for omics data integration

Abstract: BackgroundCanonical correlation analysis (CCA) is a classic statistical tool for investigating complex multivariate data. Correspondingly, it has found many diverse applications, ranging from molecular biology and medicine to social science and finance. Intriguingly, despite the importance and pervasiveness of CCA, only recently a probabilistic understanding of CCA is developing, moving from an algorithmic to a model-based perspective and enabling its application to large-scale settings.ResultsHere, we revisit… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
31
0
2

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 40 publications
(33 citation statements)
references
References 28 publications
0
31
0
2
Order By: Relevance
“…For example, Bylesjö et al [ 1 ] used the genomic variation present in the genomic dataset to harness the inter-individual heterogeneity resulting from baseline fluctuation and differentiate it from treatment induced variation. However, integrative analysis sometime needs robust models to account for individual variations [ 53 , 59 ].…”
Section: Challenges In Metabolomics and Multi-omics Data Integratimentioning
confidence: 99%
“…For example, Bylesjö et al [ 1 ] used the genomic variation present in the genomic dataset to harness the inter-individual heterogeneity resulting from baseline fluctuation and differentiate it from treatment induced variation. However, integrative analysis sometime needs robust models to account for individual variations [ 53 , 59 ].…”
Section: Challenges In Metabolomics and Multi-omics Data Integratimentioning
confidence: 99%
“…Canonical Correlation Analysis (CCA) 70 was applied in order to investigate patterns of association between genes related to CMSP and NMI genes, considering the observations from the SC2 group alone. We retained the first two canonical variates (CVs) for subsequent interpretations.…”
Section: Canonical Correlation Analysismentioning
confidence: 99%
“…The sample median and sample interquartile range were calculated using R software version 4.0.2 (https://www.r-project.org/index.html). The NP-MANOVA, CCA, MVR, and Ridge Regression analysis were all performed on the R software version 4.0.2.Specifically, for NP-MANOVA, it was used the npmv package68 for CCA the whitening70 ,DFA, and CANCOR packages, for Ridge Regression the glmnet package and for PCA the psych package. MVR was implemented by the authors following the results of García et al(2003)…”
mentioning
confidence: 99%
“…Jendoubi et al proposed a CCA probabilistic model in the form of a two-layer latent variable model and used for integrated analysis of gene expression data, lipid concentration, and methylation level omics datasets. It provided a new strategy to unify the spheroidization process, multiple regression, and the corresponding probability model [ 31 ]. In addition, deep learning has been widely applied in many areas for big data analysis.…”
Section: Introductionmentioning
confidence: 99%