2008
DOI: 10.1093/bioinformatics/btn299
|View full text |Cite
|
Sign up to set email alerts
|

Inter-species normalization of gene mentions with GNAT

Abstract: The test data set, lexica, and links toexternal data are available at http://cbioc.eas.asu.edu/gnat/

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
86
0

Year Published

2010
2010
2013
2013

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 113 publications
(86 citation statements)
references
References 12 publications
0
86
0
Order By: Relevance
“…Thus, we backed off from some approaches presented previously, such as GNAT for entity mention normalization and an alignment-based pattern matching algorithm (see [20], [19]). The focus of all building blocks in our systems is on annotation time, and our goal was to provide a service that can handle a full-text article in 10 seconds and still have reasonable accuracy; our most accurate composition/tuning parameters should still be able to analyze a full text in about two minutes.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…Thus, we backed off from some approaches presented previously, such as GNAT for entity mention normalization and an alignment-based pattern matching algorithm (see [20], [19]). The focus of all building blocks in our systems is on annotation time, and our goal was to provide a service that can handle a full-text article in 10 seconds and still have reasonable accuracy; our most accurate composition/tuning parameters should still be able to analyze a full text in about two minutes.…”
Section: Methodsmentioning
confidence: 99%
“…NER systems in recent years have tended towards primarily employing machine learning techniques, including conditional random fields, due to the consistently high performance these techniques provide when trained on a high-quality corpora such as the BioCreative II gene mention corpus [22]. In contrast, however, systems for identifying proteins (also called normalization and grounding, EMN) have largely focused on dictionary-based techniques; some notable recent systems include GNAT [20] and GeNo [23]. For BioCreative II.5, we have identified several attributes which are useful for supporting the identification of proteins found, including: 1) the ability to associate a confidence with each mention found; 2) improving consistency via enforcing a one-senseper-document assumption; 3) generating a list of candidate proteins (identifications) to which each mention could refer.…”
Section: Named Entity Recognition and Identificationmentioning
confidence: 99%
See 3 more Smart Citations