2021
DOI: 10.1093/nargab/lqab012
|View full text |Cite
|
Sign up to set email alerts
|

F-Seq2: improving the feature density based peak caller with dynamic statistics

Abstract: Genomic and epigenomic features are captured at a genome-wide level by using high-throughput sequencing (HTS) technologies. Peak calling delineates features identified in HTS experiments, such as open chromatin regions and transcription factor binding sites, by comparing the observed read distributions to a random expectation. Since its introduction, F-Seq has been widely used and shown to be the most sensitive and accurate peak caller for DNase I hypersensitive site (DNase-seq) data. However, the first releas… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 28 publications
0
11
0
Order By: Relevance
“…However, to gauge performance and ensure viability, some proxy for ground truth is needed. Following [Zhao and Boyle, 2021], we constructed a union set , , of conservative Irreproducible Discovery Rate (IDR) [Li et al, 2011] peaks from ENCODE transcription factor ChIP-seq experiments in the lymphoblast cell line. We assume that the majority of annotated transcription factor binding sites will correspond to open chromatin regions, but we note that variability in binding at a snapshot in time, the incomplete annotation of all transcription factor binding, and cases where factors can bind to non-accessible chromatin introduce notable limitations.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…However, to gauge performance and ensure viability, some proxy for ground truth is needed. Following [Zhao and Boyle, 2021], we constructed a union set , , of conservative Irreproducible Discovery Rate (IDR) [Li et al, 2011] peaks from ENCODE transcription factor ChIP-seq experiments in the lymphoblast cell line. We assume that the majority of annotated transcription factor binding sites will correspond to open chromatin regions, but we note that variability in binding at a snapshot in time, the incomplete annotation of all transcription factor binding, and cases where factors can bind to non-accessible chromatin introduce notable limitations.…”
Section: Resultsmentioning
confidence: 99%
“…We then computed precision as and recall as where D X denotes the consensus peaks obtained from method X and set intersections in the numerators are computed using . The F β -score was then calculated as the harmonic mean of precision and recall where recall is weighed β times as much as precision, that is: As in [Zhao and Boyle, 2021], we use the ℱ β -score as the primary metric for comparison of methods since it intuitively combines both precision and recall and is less affected by extreme regions of the precision-recall curve that do not correspond to realistic use-cases.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…F-Seq2 [21] was implemented as a Linux command line program. It was downloaded from Github https://github.com/Boyle-Lab/F-Seq2.…”
Section: F-seq2mentioning
confidence: 99%