2020
DOI: 10.1186/s13059-020-02194-x
|View full text |Cite
|
Sign up to set email alerts
|

STARRPeaker: uniform processing and accurate identification of STARR-seq active regions

Abstract: STARR-seq technology has employed progressively more complex genomic libraries and increased sequencing depths. An issue with the increased complexity and depth is that the coverage in STARR-seq experiments is non-uniform, overdispersed, and often confounded by sequencing biases, such as GC content. Furthermore, STARR-seq readout is confounded by RNA secondary structure and thermodynamic stability. To address these potential confounders, we developed a negative binomial regression framework for uniformly proce… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
10

Relationship

1
9

Authors

Journals

citations
Cited by 41 publications
(34 citation statements)
references
References 60 publications
0
34
0
Order By: Relevance
“…The reads of the two replicates from each sample were sorted and merged and reads falling into regions from the ENCODE blacklist of the mouse reference genome were removed. STARRPeaker (version 1.0) ( 71 ) was used to identify potential enhancers with default parameters and an adjusted P -value threshold of 0.05. Potential enhancers called from STARR-seq data were associated with expressed TE-associated DHSs by overlap using BEDTools.…”
Section: Methodsmentioning
confidence: 99%
“…The reads of the two replicates from each sample were sorted and merged and reads falling into regions from the ENCODE blacklist of the mouse reference genome were removed. STARRPeaker (version 1.0) ( 71 ) was used to identify potential enhancers with default parameters and an adjusted P -value threshold of 0.05. Potential enhancers called from STARR-seq data were associated with expressed TE-associated DHSs by overlap using BEDTools.…”
Section: Methodsmentioning
confidence: 99%
“…This method allows for the simultaneous screening of the entire genome for enhancer activity [ 95 , 102 , 106 ]. There are a number of available methods for analysis of STARR-seq data and identification of enhancer peaks [ 107 , 108 ]. Drawbacks of STARR-seq are twofold; the first being that many enhancers are “context dependent”, meaning that their position in the genome is important, and the STARR-seq approach removes DNA fragments from their genomic context.…”
Section: Direct Methods For Enhancer Discoverymentioning
confidence: 99%
“…SRP118092 ( https://trace.ncbi.nlm.nih.gov/Traces/sra/?study=SRP118092 ) [ 49 ]) and processed as described by Wang et al [ 50 ]. By analyzing this dataset, we identified ~ 47,000 promoter-distal genomic regions that are enriched for HiDRA signals which was referred to as “STARR active enhancers.” STARR-seq active regions and read density profiles in K562 cells were collected [ 51 ].…”
Section: Methodsmentioning
confidence: 99%