Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021
DOI: 10.18653/v1/2021.emnlp-main.291
|View full text |Cite
|
Sign up to set email alerts
|

A Fine-Grained Domain Adaption Model for Joint Word Segmentation and POS Tagging

Abstract: Domain adaption for word segmentation and POS tagging is a challenging problem for Chinese lexical processing. Self-training is one promising solution for it, which struggles to construct a set of high-quality pseudo training instances for the target domain. Previous work usually assumes a universal sourceto-target adaption to collect such pseudo corpus, ignoring the different gaps from the target sentences to the source domain. In this work, we start from joint word segmentation and POS tagging, presenting a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 28 publications
0
0
0
Order By: Relevance
“…For example, Dong and Schäfer (2011) use an ensemble learning model for reliable newly labeled data selection in classification. Wang et al (2020) show that sampling strategies may vary across datasets, and Jiang et al (2021) propose using out-of-vocabulary numbers to measure source-target domain distance in Chinese word segmentation and POS tagging. In this work, we propose adopting grammar rules as a selection criterion and combining it with confidence-based selection.…”
Section: Related Workmentioning
confidence: 99%
“…For example, Dong and Schäfer (2011) use an ensemble learning model for reliable newly labeled data selection in classification. Wang et al (2020) show that sampling strategies may vary across datasets, and Jiang et al (2021) propose using out-of-vocabulary numbers to measure source-target domain distance in Chinese word segmentation and POS tagging. In this work, we propose adopting grammar rules as a selection criterion and combining it with confidence-based selection.…”
Section: Related Workmentioning
confidence: 99%