2022
DOI: 10.1016/j.neucom.2021.10.101
|View full text |Cite
|
Sign up to set email alerts
|

Chinese named entity recognition: The state of the art

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
52
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 110 publications
(52 citation statements)
references
References 39 publications
0
52
0
Order By: Relevance
“…In the future, a much more reliable glossary of names should be established by making use of the organizational registers and donation registers of local Chinese societies to enhance the recognition accuracy of named entities (Humbel et al , 2021; Xu et al , 2020). Furthermore, future research may focus on collecting more external information, such as character representations and cross-lingual information, and maintaining more and higher quality CNER data sets like ENER, to improve the model performance of CNER (Liu et al , 2022).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In the future, a much more reliable glossary of names should be established by making use of the organizational registers and donation registers of local Chinese societies to enhance the recognition accuracy of named entities (Humbel et al , 2021; Xu et al , 2020). Furthermore, future research may focus on collecting more external information, such as character representations and cross-lingual information, and maintaining more and higher quality CNER data sets like ENER, to improve the model performance of CNER (Liu et al , 2022).…”
Section: Discussionmentioning
confidence: 99%
“…In text analysis of different languages, the research related to English NER (ENER) was developed early, but it is relatively less difficult to implement because only the characteristics of the words themselves need to be considered without the problem of word segmentation. In contrast, in the case of CNER, the inherent specificity of the text must first be analysed for further lexical analysis, which will present a higher technical challenge in comparison with ENER (Liu et al , 2022; Sun and Wang, 2010). Until recently, numerous digital humanities tools commonly applied NER to develop digital humanities tools to support digital humanities research.…”
Section: Literature Reviewmentioning
confidence: 99%
“…For dataset division, four datasets were involved in the experiment, namely KIWID, BOSON, ClueNER, and People's Daily. We obtained the public data according to Table 2 in study (Liu et al, 2022). This study randomly divided KIWID, BOSON, and ClueNER into training, validation, and test sets according to a ratio of 8:1:1, respectively [refer to Zhang et al (2021)].…”
Section: Dataset Divisionmentioning
confidence: 99%
“…As the amount of data increases, the workload of rule extraction increases, the difficulty of maintaining rule consistency increases, and rule‐based and dictionary‐based methods cannot address the heterogeneity and complexity of text and the thus achieve high GNER performance (Qiu, Xie, Wu, Tao, et al., 2019; Santoso et al., 2021). Compared with rule‐based methods, the statistical learning methods can learn from large amount of annotating training datasets to guide the recognition and extraction NER (Liu et al., 2022; Molina‐Villegas et al., 2021; Peng et al., 2021). The popular deep learning methods for named entity recognition generally base on word embeddings, which are able to learn similar representations for semantically or functionally similar words (Fang et al., 2021; Santoso et al., 2021; Tian et al., 2021).…”
Section: Introductionmentioning
confidence: 99%