Chinese part‐of‐speech (POS) tagging is an essential task for Chinese downstream natural language processing tasks. The accuracy of the Chinese POS task will drop dramatically by word‐based methods because of the segmentation errors and the word sparsity. Also, there are several Chinese POS tagging sets with different criteria. Some of them only have a small‐scale annotated corpus and are hard to train. To this end, we propose a modified word‐based transformer neural network architecture. Meanwhile, we utilize an adversarial transfer learning method that splits the architecture into shared and private parts. This work directly improves the ability of the word‐based model, instead of adopting a joint character‐based method. Extensive experiments show that our method achieves state‐of‐the‐art performance on all datasets, and more importantly, our method improves performance effectively for the word‐based Chinese sequence labeling task.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.