2022
DOI: 10.1101/2022.07.09.499398
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Inferring cellular and molecular processes in single-cell data with non-negative matrix factorization using Python, R, and GenePattern Notebook implementations of CoGAPS

Abstract: Non-negative matrix factorization (NMF) is an unsupervised learning method well suited to high-throughput biology. Still, inferring biological processes requires additional post hoc statistics and annotation for interpretation of features learned from software packages developed for NMF implementation. Here, we aim to introduce a suite of computational tools that implement NMF and provide methods for accurate, clear biological interpretation and analysis. A generalized discussion of NMF covering its benefits,… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 69 publications
0
3
0
Order By: Relevance
“…CoGAPS 60 , 61 (v3.22), an R package that utilizes non-negative matrix factorization to uncover latent patterns of coordinated gene expression representative of shared biological functions, was used to identify transcriptional patterns associated with VSMC subpopulations, with the npatterns parameter set to 8, in scRNAseq analysis of murine aortas. ProjectR 62 (v1.2), an R package that enables integration and analysis of multiple scRNAseq data sets by identifying transcriptional patterns shared among datasets, was used to project these patterns into scRNAseq analysis of the human aortic root and ascending aorta.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…CoGAPS 60 , 61 (v3.22), an R package that utilizes non-negative matrix factorization to uncover latent patterns of coordinated gene expression representative of shared biological functions, was used to identify transcriptional patterns associated with VSMC subpopulations, with the npatterns parameter set to 8, in scRNAseq analysis of murine aortas. ProjectR 62 (v1.2), an R package that enables integration and analysis of multiple scRNAseq data sets by identifying transcriptional patterns shared among datasets, was used to project these patterns into scRNAseq analysis of the human aortic root and ascending aorta.…”
Section: Methodsmentioning
confidence: 99%
“…Gata4-expressing VSMC2 are intrinsically "poised" towards a less-differentiated, maladaptive proinflammatory transcriptional signature. To examine the biological features of VSMC1 and VSMC2, and whether they were recapitulated in both murine and patient-derived LDS VSMCs, we used the Coordinated Gene Activity in Pattern Sets (CoGAPS) algorithm to identify latent patterns of coordinated gene expression in the Tgfbr1 M318R/+ VSMC mouse dataset 60,61 . Two patterns, transcriptional patterns 4 and 5, were found to be enriched in the VSMC2 and VSMC1 subclusters, respectively, in the Tgfbr1 M318R/+ VSMC mouse dataset (Fig.…”
Section: Expression Of Cluster-defining Transcripts For the Vsmc2 And...mentioning
confidence: 99%
“…Spots with greater than 70% purity of ductal cells were further annotated to assign agent types for the associated tumor and normal cells in each spot. For this annotation, we used our transfer learning method ProjectR 123 version 1.8.0 to distinguish proliferative signaling (modeled as an epithelial phenotype) from co-occurrence of EMT and inflammatory signaling (modeled as the mesenchymal phenotype) as defined in CoGAPS non-negative matrix factorization analysis of our reference scRNA-seq atlas of PDAC tumors using methods described previously 67,87,124 . To locate fibroblasts, Seurat version 4.1.0 was used to compute module scores from a pan-CAF gene signature as described previously 67 .…”
Section: Methodsmentioning
confidence: 99%
“…Single-cell data provide opportunities to isolate subpopulations for inferring intercellular interactions and identifying patterns of cell signaling associated with individual cell types or cell stressors. To investigate the inflammatory signaling responsible for major histocompatibility class (MHC) II expression in tumor cells demonstrated in prior work 1719 , we focused on inflammatory signaling in epithelial tumor cells learned from applying our Bayesian non-negative matrix factorization method CoGAPS 20 for discovery of gene expression patterns in this comprehensive atlas. Even with a larger cohort for analysis, in silico analyses of these data are limited in their ability to evaluate mechanism and implications of intercellular interactions.…”
Section: Main Textmentioning
confidence: 99%