2017
DOI: 10.1093/gigascience/gix032
|View full text |Cite
|
Sign up to set email alerts
|

GSuite HyperBrowser: integrative analysis of dataset collections across the genome and epigenome

Abstract: Background:Recent large-scale undertakings such as ENCODE and Roadmap Epigenomics have generated experimental data mapped to the human reference genome (as genomic tracks) representing a variety of functional elements across a large number of cell types. Despite the high potential value of these publicly available data for a broad variety of investigations, little attention has been given to the analytical methodology necessary for their widespread utilisation.Findings:We here present a first principled treatm… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
7
3

Relationship

2
8

Authors

Journals

citations
Cited by 23 publications
(26 citation statements)
references
References 38 publications
0
26
0
Order By: Relevance
“…Generally, the SNP densities decreased up to 1.5-times and the difference in the densities of the pre-miRNA and its flanks re-appeared. After additionally removing the rare SNPs [82] the densities decreased much stronger and their hierarchy was completely restored (seed < miRNA < pre-miRNA < flanks, Additional file 7). The common SNPs are likely older, they have been subjected to selective forces over time [83] and produces the difference between seed and the rest of miRNA.…”
Section: Resultsmentioning
confidence: 99%
“…Generally, the SNP densities decreased up to 1.5-times and the difference in the densities of the pre-miRNA and its flanks re-appeared. After additionally removing the rare SNPs [82] the densities decreased much stronger and their hierarchy was completely restored (seed < miRNA < pre-miRNA < flanks, Additional file 7). The common SNPs are likely older, they have been subjected to selective forces over time [83] and produces the difference between seed and the rest of miRNA.…”
Section: Resultsmentioning
confidence: 99%
“…Motif occurrences in constituent enhancers of super enhancers were identified using FIMO (find individual motif occurrences) at p-value threshold of 10 -7 and enriched pathways were identified using hypergeometric testing at a threshold of FDR-adjusted P-value <0.05. We used the GSuite hyperbrowser program (Simovski et al 2017) to perform a statistical analysis of over-representation of accessibility peaks with ENCODE datasets. To determine which ENCODE datasets exhibit the strongest similarity to accessibility regions we used the Forbes coefficient to obtain rankings of tracks, and Monte Carlo simulation to provide a statistical assessment of the robustness of the rankings of data tracks, using a null model derived from randomizing the positions of the accessibility regions relative to query tracks.…”
Section: Discussionmentioning
confidence: 99%
“…To answer the question we implemented a hypothesis-testing framework inspired by work done in statistical genomics [37,38].…”
Section: Statistical Frameworkmentioning
confidence: 99%