2019
DOI: 10.1093/bioinformatics/btz714
|View full text |Cite
|
Sign up to set email alerts
|

AssessORF: combining evolutionary conservation and proteomics to assess prokaryotic gene predictions

Abstract: Motivation A core task of genomics is to identify the boundaries of protein coding genes, which may cover over 90% of a prokaryote's genome. Several programs are available for gene finding, yet it is currently unclear how well these programs perform and whether any offers superior accuracy. This is in part because there is no universal benchmark for gene finding and, therefore, most developers select their own benchmarking strategy. Results … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

3
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 38 publications
3
8
0
Order By: Relevance
“…Functional annotations of predicted genes were performed using eggNOG-mapper v2 [7] with default parameters. While in principle alternative gene predictions can impact the subsequent functional annotation, previous empirical investigations found negligible performance variation among different tools [23] , [24] . For this reason, our tests focused on benchmarking functional annotation prediction by using a single state-of-the-art gene prediction tool.…”
Section: Resultsmentioning
confidence: 96%
“…Functional annotations of predicted genes were performed using eggNOG-mapper v2 [7] with default parameters. While in principle alternative gene predictions can impact the subsequent functional annotation, previous empirical investigations found negligible performance variation among different tools [23] , [24] . For this reason, our tests focused on benchmarking functional annotation prediction by using a single state-of-the-art gene prediction tool.…”
Section: Resultsmentioning
confidence: 96%
“…We minimized these potential errors by analyzing only genomes that had passed a high-quality CheckM filter ( Parks et al, 2015 ), yielding the 597 genomes used in our genomic analyses. However, even high-quality genomes are prone to errors of ORF annotation, especially in the identification of correct translation start sites ( Korandla et al, 2020 ), which will impact the predictions of gene and intergenic spacer sizes. Currently, there is no computational program for ORF prediction that is flawless, including GenBank ( Korandla et al, 2020 ), and we expect that future work will improve the annotations of ORFs used in our study.…”
Section: Resultsmentioning
confidence: 99%
“…al., 2020) which will impact predictions of gene and intergenic spacer sizes. Currently, there are no computational program for ORF prediction that is flawless, including GenBank (Korandla et. al., 2020), and we expect that future work will improve the annotations of ORFs used in our study.…”
Section: Resultsmentioning
confidence: 99%
“…= −0.11, p-value 0.06). A potential problem in the interpretation of this result stems from uncertainties in the identification of ORFs, most notably by errors in the identification of the correct site of initiation of protein coding regions (Korandla et. al., 2020).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation