2004
DOI: 10.1016/s1672-0229(04)02028-5
|View full text |Cite
|
Sign up to set email alerts
|

A Brief Review of Computational Gene Prediction Methods

Abstract: With the development of genome sequencing for many organisms, more and more raw sequences need to be annotated. Gene prediction by computational methods for finding the location of protein coding regions is one of the essential issues in bioinformatics. Two classes of methods are generally adopted: similarity based searches and ab initio prediction. Here, we review the development of gene prediction methods, summarize the measures for evaluating predictor quality, highlight open problems in this area, and disc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
52
0
1

Year Published

2007
2007
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 87 publications
(53 citation statements)
references
References 28 publications
(31 reference statements)
0
52
0
1
Order By: Relevance
“…At the moment, nearly 50 complete genomes of eukaryotes are publicly available, and many more are in the pipeline to be sequenced in the next few years (Liolios et al 2006). The proliferation of genome sequencing projects has driven the search for fast ways of sequence-based structural annotation, which involves the identification of genes and the modeling of their correct gene structure (Claverie et al 1997;Mathé et al 2002;Zhang 2002;Wang et al 2004). Although great progress has been achieved in gene prediction, for instance by using comparative approaches (Wasserman et al 2000;Liu et al 2004;Jin et al 2006;Wang and Zhang 2006), one of the more difficult tasks in the annotation of whole genomes remains the accurate identification and delineation of promoters (Fickett and Hatzigeorgiou 1997;Ohler 2000Ohler , 2001Bajic et al 2004Bajic et al , 2006a.…”
mentioning
confidence: 99%
“…At the moment, nearly 50 complete genomes of eukaryotes are publicly available, and many more are in the pipeline to be sequenced in the next few years (Liolios et al 2006). The proliferation of genome sequencing projects has driven the search for fast ways of sequence-based structural annotation, which involves the identification of genes and the modeling of their correct gene structure (Claverie et al 1997;Mathé et al 2002;Zhang 2002;Wang et al 2004). Although great progress has been achieved in gene prediction, for instance by using comparative approaches (Wasserman et al 2000;Liu et al 2004;Jin et al 2006;Wang and Zhang 2006), one of the more difficult tasks in the annotation of whole genomes remains the accurate identification and delineation of promoters (Fickett and Hatzigeorgiou 1997;Ohler 2000Ohler , 2001Bajic et al 2004Bajic et al , 2006a.…”
mentioning
confidence: 99%
“…The proposed methods performed well in a good number of cases. The accuracy measures for evaluating the different methods used in this paper are exon-intron discrimination factor D [23], sensitivity (S N ), specificity (S P ), miss rate (M R ), wrong rate (W R ) [3,15] and approximate correlation [28]. The discriminating factor is defined as Lowest of exon peaks Highest peak in noncoding regions D …”
Section: Resultsmentioning
confidence: 99%
“…For the last few decades, the major task of DNA and protein analysis, has been on string matching, either with a goal of obtaining a precise solution, e.g., with dynamic programming, or more commonly a fast solution, e.g., with heuristic techniques such as BLAST and several versions of FASTA [3]. But any of the string matching methodologies could not lead to satisfactory results.…”
Section: Introductionmentioning
confidence: 99%
“…There are several methods that have been applied for detecting coding regions in a genome, among which the most used are those based on sequence comparison and ab initio methods (Wang et al, 2004). Homology gene finding methods are based in given a DNA sequence identifying its homologous sequences in other genomes stored in databases.…”
Section: Introductionmentioning
confidence: 99%