2021
DOI: 10.1186/s12859-021-04141-4
|View full text |Cite
|
Sign up to set email alerts
|

Concept recognition as a machine translation problem

Abstract: Background Automated assignment of specific ontology concepts to mentions in text is a critical task in biomedical natural language processing, and the subject of many open shared tasks. Although the current state of the art involves the use of neural network language models as a post-processing step, the very large number of ontology classes to be recognized and the limited amount of gold-standard training data has impeded the creation of end-to-end systems based entirely on machine learning. … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(13 citation statements)
references
References 53 publications
0
13
0
Order By: Relevance
“…We combine the best-performing ignorance classifiers (extending the work of [82] to create a corpus of 91 articles) with state-of-the-art biomedical concept classifiers [117] to create an ignorance-base that allows us to explore it by a topic and by experimental results. The ignorance-base can be queried by ignorance category, specific lexical cues, ontology concepts, or any combination of the three.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…We combine the best-performing ignorance classifiers (extending the work of [82] to create a corpus of 91 articles) with state-of-the-art biomedical concept classifiers [117] to create an ignorance-base that allows us to explore it by a topic and by experimental results. The ignorance-base can be queried by ignorance category, specific lexical cues, ontology concepts, or any combination of the three.…”
Section: Methodsmentioning
confidence: 99%
“…The continued annotation effort used Knowtator [129] and Protege [130] as in previous work [82], allowing the ignorance taxonomy (see Table 2) to be easily browsable like an ontology, and helping the annotators select the correct level of specificity for each lexical cue. The classification frameworks and models were also from our previous work [117,82].…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Most recent publications in this area have separated the ontology annotation task to two sub-tasks -1) span detection: detecting the part of text that corresponds to an ontology concept, and 2) concept normalization: identifying the ontology concept most appropriate for the identified piece of text [27,28]. Using the CRAFT corpus as a training set, the study reports that Bidirectional encoder representations from transformers for biomedical text mining (BioBERT) resulted in the best performance (0.81 F1) for the span detection sub-task.…”
Section: Related Workmentioning
confidence: 99%