2019
DOI: 10.1038/s41467-019-13423-8
|View full text |Cite
|
Sign up to set email alerts
|

In silico prediction of high-resolution Hi-C interaction matrices

Abstract: The three-dimensional (3D) organization of the genome plays an important role in gene regulation bringing distal sequence elements in 3D proximity to genes hundreds of kilobases away. Hi-C is a powerful genome-wide technique to study 3D genome organization. Owing to experimental costs, high resolution Hi-C datasets are limited to a few cell lines. Computational prediction of Hi-C counts can offer a scalable and inexpensive approach to examine 3D genome organization across multiple cellular contexts. Here we pr… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
65
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 63 publications
(68 citation statements)
references
References 53 publications
3
65
0
Order By: Relevance
“…We show that gradient boosting achieve the best predictions, yielding accuracies of ~ 95%. Moreover, we observe that chromatin features close to anchor regions contribute most to the predictions, in line with previous observations [1517]. We also find that gradient boosting models trained with transcription-associated signals alone, unlike the other algorithms tested, are able to generate accurate predictions in K562 cell line, with transcription factors at the anchors being among the most predictive features.…”
Section: Resultssupporting
confidence: 91%
See 1 more Smart Citation
“…We show that gradient boosting achieve the best predictions, yielding accuracies of ~ 95%. Moreover, we observe that chromatin features close to anchor regions contribute most to the predictions, in line with previous observations [1517]. We also find that gradient boosting models trained with transcription-associated signals alone, unlike the other algorithms tested, are able to generate accurate predictions in K562 cell line, with transcription factors at the anchors being among the most predictive features.…”
Section: Resultssupporting
confidence: 91%
“…Therefore, in silico predictions that take advantage of the wealth of publicly available sequencing data emerges as a rational strategy to generate virtual chromatin interaction maps in new cell types for which experimental maps are still lacking. To date, several studies have been devoted to predict chromatin loops based on one-dimensional (1D) genomic information with accurate results [1317]. In such works, authors have modeled loops using different designs and machine learning approaches.…”
Section: Introductionmentioning
confidence: 99%
“…from replicates, without accounting for the distance dependence signal leads to inflated correlation values 40 . To mitigate this effect, it is possible to stratify the correlation over the distance between interacting regions 40,41 . The Hi-C skeleton transformation employed already accounts for the distance dependence in terms of DNA polymer proximity.…”
Section: Distance Stratified Correlationmentioning
confidence: 99%
“…In addition, we chose to train MATCHA using multi-way interaction data only in this work, but the model can easily include other functional genomic signals as features of the corresponding genomic bins. This would in principle extend the existing work on predicting pairwise chromatin loops based on functional genomic data (Huang et al, 2015;Whalen et al, 2016;Zhang et al, 2018;Kai et al, 2018;Zhang et al, 2019). Finally, the multi-way chromatin interactions extracted by MATCHA could also be the foundation to connect transcriptional regulation and 3D genome organization (Stadhouders et al, 2019;Kim and Shendure, 2019;Tian et al, 2020).…”
Section: Discussionmentioning
confidence: 85%