2020
DOI: 10.1186/s13073-020-00769-8
|View full text |Cite
|
Sign up to set email alerts
|

Modeling and analysis of Hi-C data by HiSIF identifies characteristic promoter-distal loops

Abstract: Current computational methods on Hi-C analysis focused on identifying Mb-size domains often failed to unveil the underlying functional and mechanistic relationship of chromatin structure and gene regulation. We developed a novel computational method HiSIF to identify genome-wide interacting loci. We illustrated HiSIF outperformed other tools for identifying chromatin loops. We applied it to Hi-C data in breast cancer cells and identified 21 genes with gained loops showing worse relapse-free survival in endocri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

3
3

Authors

Journals

citations
Cited by 12 publications
(12 citation statements)
references
References 66 publications
1
11
0
Order By: Relevance
“…Annotation to the reference genome were performed on four defined genomic regions: the TSS/TES regions between −5kb and + 1kp to the TSS (transcription start site)/TES (transcription end site) of protein-coding genes, and the gene body region between TSS and TES, and the 5′distance regions were calculated from 1 Mb to 5 kb upstream of the TSS. These four defined regions are similar to the previous publications [50] , [51] in differential methylation analysis and identification of promoter-distal loops. Gene expression levels are measured as reads per kilobase of transcript per million mapped reads (RPKM) of RNA-Seq experiments and were computed by applying the featureCounts [52] and our in-house Python code on aligned BAM files.…”
Section: Methodssupporting
confidence: 88%
“…Annotation to the reference genome were performed on four defined genomic regions: the TSS/TES regions between −5kb and + 1kp to the TSS (transcription start site)/TES (transcription end site) of protein-coding genes, and the gene body region between TSS and TES, and the 5′distance regions were calculated from 1 Mb to 5 kb upstream of the TSS. These four defined regions are similar to the previous publications [50] , [51] in differential methylation analysis and identification of promoter-distal loops. Gene expression levels are measured as reads per kilobase of transcript per million mapped reads (RPKM) of RNA-Seq experiments and were computed by applying the featureCounts [52] and our in-house Python code on aligned BAM files.…”
Section: Methodssupporting
confidence: 88%
“…We applied HiSIF [ 39 ] to identify significant interaction fragments (SIFs) for the Hi-C data in both 2D monolayers and 3D spheroids. At the optimal parameters set (t = 1, 20 Kb, FDR = 0.1), we obtained a total of 577,585 for MCF7_2D, 431,056 for MCF7_3D, 327,064 for MCF7TR_2D and 319,644 for MCF7TR_3D, respectively.…”
Section: Resultsmentioning
confidence: 99%
“…We used HiSIF [ 39 ] to identify significant interaction fragments (SIFs) with all valid pairs from HiC-Pro [ 35 ] and HISIF parameters of t = 1, s = 2, p = 1 29, w = 50, 500, 20,000. We further used FDR ≤ 0.1 as the cutoff to select the final set of SIFs.…”
Section: Methodsmentioning
confidence: 99%
“…For example, NPS/iNPS [24] , [25] applied a gaussian convolution algorithm to detect inflection points to find candidate nucleosomes, and a novel iterative correction (ICE) algorithm [26] was developed for balancing the biases in Hi-C data. The computational algorithms are often further implemented as software tools for user-friendly and interactive interfaces or downloadable executable files in public code repositories, such as NucHMM [27] for identifying nucleosome states, JuiceBox [28] for visualizing and analyzing Hi-C data, and HiSIF [29] for detecting significant interacting fragments. In addition, a series of methods and tools are sometimes combined into a workflow/pipeline to achieve its meaningful biological output.…”
Section: Introductionmentioning
confidence: 99%