2018
DOI: 10.1093/database/bay096
|View full text |Cite
|
Sign up to set email alerts
|

Wide-scope biomedical named entity recognition and normalization with CRFs, fuzzy matching and character level modeling

Abstract: We present a system for automatically identifying a multitude of biomedical entities from the literature. This work is based on our previous efforts in the BioCreative VI: Interactive Bio-ID Assignment shared task in which our system demonstrated state-of-the-art performance with the highest achieved results in named entity recognition. In this paper we describe the original conditional random field-based system used in the shared task as well as experiments conducted since, including better hyperparameter tun… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(11 citation statements)
references
References 26 publications
1
10
0
Order By: Relevance
“…However, traditional ML-based methods need extensive feature engineering, which is time-consuming and labor intensive. Based on their previous work [29], Kaewphan et al [30] further developed a BiLSTM-CRF based model, which used character embeddings learned by a Convolutional Neural Network (CNN) and the predictions from their original NERsuite model [29] as inputs of the recognition model. Neural network-based methods bring significant improvement in PNER performance (3.2% F1score improvement under strict criteria than before).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, traditional ML-based methods need extensive feature engineering, which is time-consuming and labor intensive. Based on their previous work [29], Kaewphan et al [30] further developed a BiLSTM-CRF based model, which used character embeddings learned by a Convolutional Neural Network (CNN) and the predictions from their original NERsuite model [29] as inputs of the recognition model. Neural network-based methods bring significant improvement in PNER performance (3.2% F1score improvement under strict criteria than before).…”
Section: Discussionmentioning
confidence: 99%
“…Typically, hand-crafted rules are clear and effective, but they are inflexible and hard to expand to a new dataset. Kaewphan et al [30] used the same method as their previous work [29] to perform PNEN, but based on their new recognition method. Compared with their previous results, they achieved 1.8% micro-averaged F1-score improvement from 0.397 to 0.415.…”
Section: Comparison With Related Work For Pnenmentioning
confidence: 99%
“…There are methods aimed at NER that have been developing during the last years (Kaewphan et al, 2018 ; Korvigo et al, 2018 ; Hemati and Mehler, 2019 ; Hong and Lee, 2020 ; Huang et al, 2020 ; Kilicoglu et al, 2020 ). Most of them are based on algorithms for NER related either to chemicals or biological objects.…”
Section: Introductionmentioning
confidence: 99%
“…Zero-Shot Linking with Approximate Dictionary Matching. Using Sim-String [10], we represent our restricted set of aliases from UMLS as character n-grams, similarly to previous works [11,12]. After experimenting with different sizes, we found that char n-grams of size 3 performed best.…”
Section: Solutionmentioning
confidence: 78%