2008
DOI: 10.1007/s10791-008-9053-0
|View full text |Cite
|
Sign up to set email alerts
|

Combining gene sequence similarity and textual information for gene function annotation in the literature

Abstract: Annotation of the functions of genes and proteins is an essential step in genome analysis. Information extraction techniques have been proposed to obtain the function information of genes and proteins in the biomedical literature. However, the performance of most information extraction techniques of function annotation in the biomedical literature is not satisfactory due to the large variability in the expression of concepts in the biomedical literature. This paper proposes a framework to improve the gene func… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 27 publications
0
2
0
Order By: Relevance
“…Recent years have observed development of advanced approaches for sequence‐based function prediction, 15–19 which achieve an improved accuracy and coverage in genome‐scale function assignment. Moreover, many function prediction methods have been developed that utilize other types of data, such as protein–protein interaction data, 20, 21 gene expression data, 22 and text mining, 23, 24 or combination of such heterogeneous data. 25, 26 Of recent particular importance is functional characterization of proteins from their tertiary structures as an increasing number of protein structures of unknown function have been solved by ongoing structural genomics projects.…”
Section: Introductionmentioning
confidence: 99%
“…Recent years have observed development of advanced approaches for sequence‐based function prediction, 15–19 which achieve an improved accuracy and coverage in genome‐scale function assignment. Moreover, many function prediction methods have been developed that utilize other types of data, such as protein–protein interaction data, 20, 21 gene expression data, 22 and text mining, 23, 24 or combination of such heterogeneous data. 25, 26 Of recent particular importance is functional characterization of proteins from their tertiary structures as an increasing number of protein structures of unknown function have been solved by ongoing structural genomics projects.…”
Section: Introductionmentioning
confidence: 99%
“…Such methods include those which use BLAST or PSI-BLAST search results systematically by applying algorithmic techniques and making use of the Gene Ontology (GO) vocabulary structure [80] (e.g. Gotcha [81], GoFigure [82], OntoBlast [83], PFP [78, 79, 84, 85], ESG [86], and ConFunc [87]). Another direction of recent development is to consider phylogenetic trees aiming more specific function prediction among protein subfamilies (e.g.…”
Section: Sequence-based Function Prediction Methods That Use Weakly Smentioning
confidence: 99%