2011
DOI: 10.1016/j.ins.2010.09.008
|View full text |Cite
|
Sign up to set email alerts
|

Integrating unsupervised and supervised word segmentation: The role of goodness measures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
27
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
5
3
1

Relationship

4
5

Authors

Journals

citations
Cited by 50 publications
(27 citation statements)
references
References 28 publications
0
27
0
Order By: Relevance
“…Therefore, we used the sentences from the NTCIR-8 JE test set as the development set for JE task. The word segmentation was done by BaseSeg (Zhao et al, 2006;Zhao and Kit, 2008;Zhao and Kit, 2011;Zhao et al, 2013) for Chinese and Mecab 2 for Japanese.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, we used the sentences from the NTCIR-8 JE test set as the development set for JE task. The word segmentation was done by BaseSeg (Zhao et al, 2006;Zhao and Kit, 2008;Zhao and Kit, 2011;Zhao et al, 2013) for Chinese and Mecab 2 for Japanese.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, we used the sentences from the NTCIR-8 JE test set as the development set. Word segmentation was done by BaseSeg (Zhao et al, 2006;Zhao and Kit, 2008;Zhao and Kit, 2011; for Chinese and Mecab 2 for Japanese. To learn the classifiers for each translation task, the training set and development set were put together to obtain symmetric word alignment using GIZA++ (Och and Ney, 2003) and the growdiag-final-and heuristic (Koehn et al, 2003).…”
Section: Methodsmentioning
confidence: 99%
“…Two key techniques, word segmentation (Zhao et al, 2006a;Zhao and Kit, 2008b;Zhao et al, 2006b;Zhao and Kit, 2008a;Zhao and Kit, 2007;Zhao and Kit, 2011;Zhao et al, 2010) and language model (LM), are also popularly used for C-SC. Most of those approaches can fall into four categories.…”
Section: Related Workmentioning
confidence: 99%