2017
DOI: 10.1186/s12859-017-1805-7
|View full text |Cite
|
Sign up to set email alerts
|

CNN-based ranking for biomedical entity normalization

Abstract: BackgroundMost state-of-the-art biomedical entity normalization systems, such as rule-based systems, merely rely on morphological information of entity mentions, but rarely consider their semantic information. In this paper, we introduce a novel convolutional neural network (CNN) architecture that regards biomedical entity normalization as a ranking problem and benefits from semantic information of biomedical entities.ResultsThe CNN-based ranking method first generates candidates using handcrafted rules, and t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
83
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 104 publications
(84 citation statements)
references
References 11 publications
1
83
0
Order By: Relevance
“…Later, some work [11,12] proposed to convert mentions and candidate entities into a common vector space, and then disambiguated candidate entities by a scoring function (e.g., cosine similarity). In recent years, neural network-based approaches have shown considerable success in entity normalization [13][14][15]. These methods used neural architectures to learn the context representations around an entity mention and calculated the context-entity similarity scores to determine which candidate is a correct assignment.…”
Section: Introductionmentioning
confidence: 99%
“…Later, some work [11,12] proposed to convert mentions and candidate entities into a common vector space, and then disambiguated candidate entities by a scoring function (e.g., cosine similarity). In recent years, neural network-based approaches have shown considerable success in entity normalization [13][14][15]. These methods used neural architectures to learn the context representations around an entity mention and calculated the context-entity similarity scores to determine which candidate is a correct assignment.…”
Section: Introductionmentioning
confidence: 99%
“…Abstracts Total Unique Training set 692 5932 1538 Test set 100 960 427 Total 792 6892 1965 Table 1: NCBI Disease Corpus Statistics domain sub-word level information 2 in solving the task of disease normalization. 3) Unlike existing systems (D'Souza and Ng, 2015), (Li et al, 2017), we present a robust and portable candidate generation approach without making use of external resources or hand-engineered sieves to deal with morphological variations. Our system achieves state-of-the-art performance on NCBI disease dataset (Dogan et al, 2014) 2 Dataset…”
Section: Datasetmentioning
confidence: 99%
“…Accuracy (D'Souza and Ng, 2015) 84.65 (Li et al, 2017) 86 We choose the evaluation measure as accuracy. Since, the highest similar candidate is of primary interest in the task of entity linking, so we choose the top-K ( Where K = 1).…”
Section: Model Namementioning
confidence: 99%
“…Besides the bacteria biotopes, there exist a significant amount of prior work on biomedical named entity normalization for different types of biomedical entities including genes/proteins (Morgan et al, 2008;Wermter et al, 2009;Lu et al, 2011; and diseases (Leaman et al, 2013;Li et al, 2017). However, the need for manually annotated training data makes the adaptation of such methods to new entities difficult.…”
Section: Named Entity Normalizationmentioning
confidence: 99%