2016
DOI: 10.1093/bib/bbv108
|View full text |Cite
|
Sign up to set email alerts
|

Dimension reduction techniques for the integrative analysis of multi-omics data

Abstract: State-of-the-art next-generation sequencing, transcriptomics, proteomics and other high-throughput ‘omics' technologies enable the efficient generation of large experimental data sets. These data may yield unprecedented knowledge about molecular pathways in cells and their role in disease. Dimension reduction approaches have been widely used in exploratory analysis of single omics data sets. This review will focus on dimension reduction approaches for simultaneous exploratory analyses of multiple data sets. Th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
253
0
3

Year Published

2017
2017
2023
2023

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 326 publications
(256 citation statements)
references
References 82 publications
(125 reference statements)
0
253
0
3
Order By: Relevance
“…This statistical technique projects the information in multi-variable/-parameter datasets onto low dimensions, typically two dimensions, which is useful here to facilitate meaningful comparisons and clustering of the commercial antibodies by means of optimal linear combinations (principal components, PCs) of the original metrics, that are represented visually in a biplot [33, 34]. For example, Hemmink et al [35] have used PCA to analyse immunological datasets following an influenza pathogenesis study in pigs showing the correlation of cytokine production, with viral titre over time.…”
Section: Methodsmentioning
confidence: 99%
“…This statistical technique projects the information in multi-variable/-parameter datasets onto low dimensions, typically two dimensions, which is useful here to facilitate meaningful comparisons and clustering of the commercial antibodies by means of optimal linear combinations (principal components, PCs) of the original metrics, that are represented visually in a biplot [33, 34]. For example, Hemmink et al [35] have used PCA to analyse immunological datasets following an influenza pathogenesis study in pigs showing the correlation of cytokine production, with viral titre over time.…”
Section: Methodsmentioning
confidence: 99%
“…Some methods search for a “consensus” after clustering patients by each platform separately [37], or cluster with protein-protein interactions [29], or patient similarity networks [80, 81]. Other methods formulate the problem as a “multi-view” matrix factorization and dimension reduction , or as a probabilistic model (reviewed in [82]). In all cases, a key challenge is the selection of features from each platform as inputs to the clustering algorithms; for example, it is possible to summarize mutations, gene expression, and DNA methylation events as binary alterations [80], and then treat any missing data as a non-alteration event.…”
Section: Analysis Approaches To Determine Molecular Subtypes and Cancmentioning
confidence: 99%
“…Although factor models that aim to address this have previously been proposed (e.g. Meng et al , , ; Tenenhaus et al , ; preprint: Singh et al , ), these methods either lack sparsity, which can reduce interpretability, or require a substantial number of parameters to be determined using computationally demanding cross‐validation or post hoc. Further challenges faced by existing methods are computational scalability to larger data sets, handling of missing values and non‐Gaussian data modalities, such as binary readouts or count‐based traits.…”
Section: Introductionmentioning
confidence: 99%