2020
DOI: 10.3390/sym12101673
|View full text |Cite
|
Sign up to set email alerts
|

POISE: Efficient Cross-Domain Chinese Named Entity Recognization via Transfer Learning

Abstract: To improve the performance of deep learning methods in case of a lack of labeled data for entity annotation in entity recognition tasks, this study proposes transfer learning schemes that combine the character to be the word to convert low-resource data symmetry into high-resource data. We combine character embedding, word embedding, and the embedding of the label features using high- and low-resource data based on the BiLSTM-CRF model, and perform the feature-transfer and parameter-sharing tasks in two domain… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 30 publications
0
5
0
Order By: Relevance
“…In order to evaluate the performance of NEFTL-Boud model in migrating data to complete the task of named entity recognition, experiments will be compared with the following four Chinese NER methods or models based on migration learning: (1) POISE [27]. (2) NER-CWS [24].…”
Section: Baseline Comparison Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to evaluate the performance of NEFTL-Boud model in migrating data to complete the task of named entity recognition, experiments will be compared with the following four Chinese NER methods or models based on migration learning: (1) POISE [27]. (2) NER-CWS [24].…”
Section: Baseline Comparison Methodsmentioning
confidence: 99%
“…1. ACL paper data set: this data set is the resume data collected in the paper ACL 2018 Chinese NER using Lattice LSTM [27], and the format of the data is shown in Table 4. Each line is composed of a word and its corresponding labels, with tabs as separators, and sentences are separated by a blank line.…”
Section: Datasetmentioning
confidence: 99%
“…B is the beginning of the entity, I is the non-beginning part of the entity and O is the non-entity part. There are seven types of tags to be predicted, namely, I-PER, I-ORG, I-LOC, B-PER, B-ORG, B-LOC and O [34].…”
Section: Labeling Methods and Model Evaluation Indexmentioning
confidence: 99%
“…Therefore, transfer learning is proposed to bridge this gap. Transfer learning involves using pre-trained models from images in other domains and fine-tuning the parameters of the model on the target dataset to make it applicable to the target task [19,20]. This process is widely used in the field of medical image analysis [21].…”
Section: Introductionmentioning
confidence: 99%