2021
DOI: 10.1186/s13059-021-02378-z
|View full text |Cite
|
Sign up to set email alerts
|

GRiNCH: simultaneous smoothing and detection of topological units of genome organization from sparse chromatin contact count matrices with matrix factorization

Abstract: High-throughput chromosome conformation capture assays, such as Hi-C, have shown that the genome is organized into organizational units such as topologically associating domains (TADs), which can impact gene regulatory processes. The sparsity of Hi-C matrices poses a challenge for reliable detection of these units. We present GRiNCH, a constrained matrix-factorization-based approach for simultaneous smoothing and discovery of TADs from sparse contact count matrices. GRiNCH shows superior performance against se… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 23 publications
(31 citation statements)
references
References 86 publications
0
31
0
Order By: Relevance
“…Applying NMF to a Hi-C matrix yields low-dimensional factors or latent features which can be used to cluster the row or the column entities, i.e., genomic regions. These clusters of genomic regions correspond to densely interacting regions of the genome such as topologically associating domains (TADs) ( Lee and Roy, 2021 ). MVNMF extends NMF to a multi-task setting with multiple input matrices, each corresponding to a task or a time point (e.g.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Applying NMF to a Hi-C matrix yields low-dimensional factors or latent features which can be used to cluster the row or the column entities, i.e., genomic regions. These clusters of genomic regions correspond to densely interacting regions of the genome such as topologically associating domains (TADs) ( Lee and Roy, 2021 ). MVNMF extends NMF to a multi-task setting with multiple input matrices, each corresponding to a task or a time point (e.g.…”
Section: Resultsmentioning
confidence: 99%
“…We use a simple heuristic to pick , the number of the factors, which also is the number of clusters. Based on our previous work on single-task NMF to Hi-C data, we set such that the expected size of each cluster is about 1 Mb in length, which corresponds to the average size of TADs ( Lee and Roy, 2021 ). For example, for an input matrix that corresponds to a chromosome of size 10 Mb, we set Here, we used 56-282 factors to capture TAD like structures, corresponding to the size of the rat chromosomes.…”
Section: Methodsmentioning
confidence: 99%
“…Lee et al [ 85 ] developed a clustering method based on constrained non-negative matrix-factorization (NMF), called GRiNCH. GRiNCH is based on NMF, which is a strong dimensionality reduction technique to infer meaningful low-dimensional structures from higher-dimensional data.…”
Section: Methodsmentioning
confidence: 99%
“…Only in last two years, many techniques have been proposed to detect TADs across different species. These novel methods include the most recent TADBD, SpectralTAD, OnTAD, Constrained HAC, MSTD, GRiNCH, and deDoc [ 1 , 3 , 56 , 57 , 59 85 ]. Up-to-now, there are almost 31 TAD callers and Table 1 summarizes the characteristics of these methods.…”
Section: Introductionmentioning
confidence: 99%
“…It is worth mentioning that a set of methods were particularly developed for some variant Hi-C techniques such as ChIA-PET [76] , capture-Hi-C [77] ), HiChIP [78] and Split-Pool Recognition of Interactions by Tag Extension (SPRITE) [79] due to their sparse or asymmetric interaction matrices. GRiNCH [80] is one such tool based on a non-negative matrix factorization and graph regularization, to identify TADs of genome organization from sparse chromosome interaction matrices.…”
Section: Computational Methods In Mapping 3d Chromatin Domains and In...mentioning
confidence: 99%