2020
DOI: 10.1609/aaai.v34i10.7233
|View full text |Cite
|
Sign up to set email alerts
|

Using Chinese Glyphs for Named Entity Recognition (Student Abstract)

Abstract: Most Named Entity Recognition (NER) systems use additional features like part-of-speech (POS) tags, shallow parsing, gazetteers, etc. Adding these external features to NER systems have been shown to have a positive impact. However, creating gazetteers or taggers can take a lot of time and may require extensive data cleaning. In this work instead of using these traditional features we use lexicographic features of Chinese characters. Chinese characters are composed of graphical components called radicals and th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 14 publications
(8 citation statements)
references
References 2 publications
0
5
0
Order By: Relevance
“…This indicates that Simplified Lattice does not exploit the full potential of BERT-wwm-ext. GLYNN [21] improved the performance of Chinese NER by using a CNN encoder to integrate glyph features from character images, indicating that Chinese NER can benefit from the pictogram features. The SLRL-NER model integrates lexicon, character, and radical-level features leads to 0.47 and 0.25 increments of F1 score over the state of-the-art model on OntoNotes 4.0 and Resume, respectively.…”
Section: B Experimental Resultsmentioning
confidence: 99%
“…This indicates that Simplified Lattice does not exploit the full potential of BERT-wwm-ext. GLYNN [21] improved the performance of Chinese NER by using a CNN encoder to integrate glyph features from character images, indicating that Chinese NER can benefit from the pictogram features. The SLRL-NER model integrates lexicon, character, and radical-level features leads to 0.47 and 0.25 increments of F1 score over the state of-the-art model on OntoNotes 4.0 and Resume, respectively.…”
Section: B Experimental Resultsmentioning
confidence: 99%
“…The interactive knowledge between the glyph and context is ignored. Second, the entity type and quantity of modern Chinese are far richer and more complex than those of ancient Chinese, it has been proven that historical texts are meaningless to NER, to some extent [ 25 ].…”
Section: Related Workmentioning
confidence: 99%
“…Meng et al [36] proposed a Tianzige-CNN to capture the graphic features of the Traditional Chinese that contain more pictographic information. Song and Sehanobish [29] also proposed Table 9: An example from OntoNotes4.0 dataset. Characters colored in blue and red represent the correct and incorrect recognized entity, respectively.…”
Section: Related Workmentioning
confidence: 99%