2005
DOI: 10.1038/nbt1053
|View full text |Cite
|
Sign up to set email alerts
|

Assessing computational tools for the discovery of transcription factor binding sites

Abstract: The prediction of regulatory elements is a problem where computational methods offer great hope. Over the past few years, numerous tools have become available for this task. The purpose of the current assessment is twofold: to provide some guidance to users regarding the accuracy of currently available tools in various settings, and to provide a benchmark of data sets for assessing future tools.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

15
1,154
3
12

Year Published

2006
2006
2017
2017

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 1,117 publications
(1,184 citation statements)
references
References 17 publications
15
1,154
3
12
Order By: Relevance
“…This is in practice a complex task because the application domain may be skewed in two ways 4 . First, for many relevant bioinformatics problems the prevalence of positives in nature q P = ( TP + FN )/( TP + TN + FP + FN ) does not necessarily match the training set q P and is hard to estimate 2, 5 . Second, the yields (or costs) for correct and incorrect classification of positives and negatives in the machine learning paradigm ( Y TP , Y TN , Y FP , Y FN ) may be different from each other and highly context-dependent 1, 3 .…”
Section: Introductionmentioning
confidence: 99%
“…This is in practice a complex task because the application domain may be skewed in two ways 4 . First, for many relevant bioinformatics problems the prevalence of positives in nature q P = ( TP + FN )/( TP + TN + FP + FN ) does not necessarily match the training set q P and is hard to estimate 2, 5 . Second, the yields (or costs) for correct and incorrect classification of positives and negatives in the machine learning paradigm ( Y TP , Y TN , Y FP , Y FN ) may be different from each other and highly context-dependent 1, 3 .…”
Section: Introductionmentioning
confidence: 99%
“…Computational prediction of cis-regulatory binding sites is widely acknowledged as a difficult task [1]. Binding sites are notoriously variable from instance to instance and they can be located considerable distances from the gene being regulated in higher eukaryotes.…”
Section: Introductionmentioning
confidence: 99%
“…A statistic comparing the accuracy of the main tools to discover TFBSs is found in Tompa [114], but it is very difficult to compare the performance of methods, in particular on complex genomes like the human genome.…”
Section: Promoter Analysismentioning
confidence: 99%