2017
DOI: 10.1186/s13326-017-0157-6
|View full text |Cite
|
Sign up to set email alerts
|

Entity recognition in the biomedical domain using a hybrid approach

Abstract: BackgroundThis article describes a high-recall, high-precision approach for the extraction of biomedical entities from scientific articles.MethodThe approach uses a two-stage pipeline, combining a dictionary-based entity recognizer with a machine-learning classifier. First, the OGER entity recognizer, which has a bias towards high recall, annotates the terms that appear in selected domain ontologies. Subsequently, the Distiller framework uses this information as a feature for a machine learning algorithm to se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(44 citation statements)
references
References 39 publications
0
44
0
Order By: Relevance
“…The concept annotations of version 3.0 of the corpus are now available in Knowtator 1, Knowtator 2, brat, and UIMA XMI formats (though no longer in AO RDF or Genia XML formats). Version 1.0 was used for evaluations in previous publications (e.g., [4], [5], and [6]), so performance of the sequence-to-sequence method was also evaluated using that first release. The sequence-to-sequence approach was evaluated on version 3.0 as well.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The concept annotations of version 3.0 of the corpus are now available in Knowtator 1, Knowtator 2, brat, and UIMA XMI formats (though no longer in AO RDF or Genia XML formats). Version 1.0 was used for evaluations in previous publications (e.g., [4], [5], and [6]), so performance of the sequence-to-sequence method was also evaluated using that first release. The sequence-to-sequence approach was evaluated on version 3.0 as well.…”
Section: Methodsmentioning
confidence: 99%
“…Groza et al [9] approached the task as in information retrieval using case sensitivity and information gain. Basaldella et al [6] described a system relying on high-recall dictionary lookups followed by a highprecision machine learning classifier. We compare our results to each of these systems.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Several text mining methods and tools have been developed to solve the problem of named entity recognition by using different approaches [14]. However, to handle with identifying biological entities issues, there are four possible and common approaches [2].…”
Section: Problem Statementmentioning
confidence: 99%
“…NER is followed by a normalization step mapping the entities to a fixed set of identifiers, such as HGNC gene symbols [Yates et al, 2017] or Disease Ontology terms [Kibbe et al, 2015]. General approaches such as LINNAEUS [Gerner et al, 2010], Tagger [Pafilis et al, 2013], taggerOne [Leaman and Lu, 2016], or OGER [Basaldella et al, 2017] recognize diverse biomedical entities in text, while specialized tools recognize mentions of genetic variants [Allot et al, 2018] or chemicals [Jessop et al, 2011].…”
Section: Introductionmentioning
confidence: 99%