2017
DOI: 10.1101/196915
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Enter the matrix: factorization uncovers knowledge from omics Names/Affiliations

Abstract: SummaryHigh-dimensional data is currently standard for biological inquiry. Biological systems are comprised of interrelated gene regulatory mechanisms, gene-gene interactions, and cellular interactions. These interactions induce low-dimensional structure within the high-dimensional data. Matrix factorization, also known as compressed sensing, learns low-dimensional mathematical representations from high-dimensional data. These factorization techniques can embed assumptions about pleiotropy, epistasis, inter-re… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 136 publications
0
2
0
Order By: Relevance
“…There are also technical considerations, as measuring combinations of genes will reduce the multiple hypothesis testing burden, can be useful for feature engineering, and is more likely to lead to more robust results than analyses of individual gene measurements (Cleary et al, 2017). These methods have advantages over two-group gene set-based comparisons because they provide more context for genes, are better fit to the underlying data, and remove the difficulty of identifying the most useful comparisons a priori (Stein-O'Brien et al, 2017a). Unsupervised machine learning (ML) methods including matrix factorization-and autoencoder-based approaches have successfully extracted biologically meaningful low-dimensional representations of gene expression data that can distinguish disease types, predict drug response, and identify new pathway regulators (Dincer et al, 2018;Stein-O'Brien et al, 2017b;Tan et al, 2017;Way and Greene, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…There are also technical considerations, as measuring combinations of genes will reduce the multiple hypothesis testing burden, can be useful for feature engineering, and is more likely to lead to more robust results than analyses of individual gene measurements (Cleary et al, 2017). These methods have advantages over two-group gene set-based comparisons because they provide more context for genes, are better fit to the underlying data, and remove the difficulty of identifying the most useful comparisons a priori (Stein-O'Brien et al, 2017a). Unsupervised machine learning (ML) methods including matrix factorization-and autoencoder-based approaches have successfully extracted biologically meaningful low-dimensional representations of gene expression data that can distinguish disease types, predict drug response, and identify new pathway regulators (Dincer et al, 2018;Stein-O'Brien et al, 2017b;Tan et al, 2017;Way and Greene, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…GRiNCH is based on non-negative matrix factorization (NMF), a powerful dimensionality reduction method used to recover interpretable low-dimensional structure from high-dimensional datasets [32][33][34].…”
Section: Introductionmentioning
confidence: 99%