2014
DOI: 10.1136/amiajnl-2013-002381
|View full text |Cite
|
Sign up to set email alerts
|

A comprehensive study of named entity recognition in Chinese clinical text

Abstract: Our evaluation on the independent test set showed that most types of feature were beneficial to Chinese NER systems, although the improvements were limited. The system achieved the highest performance by combining word segmentation and section information, indicating that these two types of feature complement each other. When the same types of optimized feature were used, CRF and SSVM outperformed SVM and ME. More specifically, SSVM achieved the highest performance of the four algorithms, with F-measures of 93… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
97
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
5
2
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 169 publications
(97 citation statements)
references
References 12 publications
0
97
0
Order By: Relevance
“…In particular, we choose a linear chain CRF; previous research shows that these perform well for various natural language processing tasks, especially Named Entity Recognition [Lei et al, 2014]. In contrast to general purpose noun phrase extractors used by some other existing models for this task, a CRF can easily exploit the information of the given annotated files as features.…”
Section: Methodsmentioning
confidence: 99%
“…In particular, we choose a linear chain CRF; previous research shows that these perform well for various natural language processing tasks, especially Named Entity Recognition [Lei et al, 2014]. In contrast to general purpose noun phrase extractors used by some other existing models for this task, a CRF can easily exploit the information of the given annotated files as features.…”
Section: Methodsmentioning
confidence: 99%
“…Named entities are key fundamental elements in paragraphs and notes [5]. In clinical narratives, named entities are meaningful phrases such as diseases, symptoms, tests, medications, treatments and so on [6], which constitute the primary lines of the document and could be quickly parsed and reused to support writing assistant.…”
Section: Named Entity Recognitionmentioning
confidence: 99%
“…The early named entity recognition (NER) mostly uses rule-based methods, but they are highly dependent on language, field and text style of corpuses with limitation on cost effectiveness. Nowadays, NER researches mostly adopt statistical machine learning methods, such as the common use of HMM [1][2], SVM [3] [4], conditional random fields (CRF) [6] [7] etc., and more researches indicate that CRF is of favorable effect in NER work [1] [3].…”
Section: Introductionmentioning
confidence: 99%