2020
DOI: 10.1016/j.jbi.2019.103356
|View full text |Cite
|
Sign up to set email alerts
|

Terminologies augmented recurrent neural network model for clinical named entity recognition

Abstract: ObjectiveWe aimed to enhance the performance of a supervised model for clinical named-entity recognition (NER) using medical terminologies. In order to evaluate our system in French, we built a corpus for 5 types of clinical entities. MethodsWe used a terminology-based system as baseline, built upon UMLS and SNOMED. Then, we evaluated a biGRU-CRF, and an hybrid system using the prediction of the terminology-based system as feature for the biGRU-CRF. In English, we evaluated the NER systems on the i2b2-2009 Med… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
2

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 31 publications
(24 citation statements)
references
References 17 publications
0
22
2
Order By: Relevance
“…Moreover, we took the condition of the intake, and not the reason for the intake, into consideration (which is more specific), and we added a tag regarding the class name; therefore, overall F-measures cannot be compared. Compared with results from a study [33] using a different French-language corpus that obtained a token-level F-measure of 90.4, our system's raw results were higher. Comparisons should be made with caution because the corpus used in [33], though in the same language, was from a different source and contained only 147 documents.…”
Section: Related Workcontrasting
confidence: 62%
See 2 more Smart Citations
“…Moreover, we took the condition of the intake, and not the reason for the intake, into consideration (which is more specific), and we added a tag regarding the class name; therefore, overall F-measures cannot be compared. Compared with results from a study [33] using a different French-language corpus that obtained a token-level F-measure of 90.4, our system's raw results were higher. Comparisons should be made with caution because the corpus used in [33], though in the same language, was from a different source and contained only 147 documents.…”
Section: Related Workcontrasting
confidence: 62%
“…Compared with results from a study [33] using a different French-language corpus that obtained a token-level F-measure of 90.4, our system's raw results were higher. Comparisons should be made with caution because the corpus used in [33], though in the same language, was from a different source and contained only 147 documents.…”
Section: Related Workcontrasting
confidence: 62%
See 1 more Smart Citation
“…French was the third language (seven mentions) as in the work by Lerner et al 6 , followed by three other European languages with less than five mentions: German, Italian 7 , and Spanish 8 ;…”
Section: Principal Findingsmentioning
confidence: 84%
“…Yet, we can consider that the papers that did not explicitly indicate the language should also be dedicated to the processing of data in English ; • Chinese became the second language processed in medical NLP papers with 17 mentions. Among the papers published in 2019, we can mention Guan et al [3] working on the generation of synthetic medical record texts, Chen et al [4] aiming at identifying named entities, and Zheng et al [5] interested by the detection of medical text similarity ; • French was the third language (seven mentions) as in the work by Lerner et al [6], followed by three other European languages with less than five mentions: German, Italian [7], and Spanish [8] ; • Other languages identified in the abstracts accounted for one or two papers and included both languages spoken by millions of people (Arabic, Portuguese, Russian) and languages spoken by small communities (Basque, Danish, Japanese, Korean, Lithuanian, Persian, Romanian, Turkish, and Urdu).…”
Section: The Languages Addressedmentioning
confidence: 99%