“…Chinese word segmentation was performed first before applying character sequence labeling (Guo et al, 2004;Mao et al, 2008;Zhu and Wang, 2019). The pre-processing segmentation features included character positional embedding (Peng and Dredze, 2015;He and Sun, 2017a,b), segmentation tags Zhu and Wang, 2019), word embedding (Peng and Dredze, 2015;Liu et al, 2019;E and Xiang, 2017) and so on. The other was to train NER and CWS tasks jointly to incorporate task-shared word boundary information from the CWS into the NER (Xu et al, 2013;Peng and Dredze, 2016;Cao et al, 2018).…”