Proceedings of the Second SIGHAN Workshop on Chinese Language Processing - 2003
DOI: 10.3115/1119250.1119280
|View full text |Cite
|
Sign up to set email alerts
|

HHMM-based Chinese lexical analyzer ICTCLAS

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
174
0

Year Published

2005
2005
2019
2019

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 358 publications
(180 citation statements)
references
References 3 publications
1
174
0
Order By: Relevance
“…the one-month People's Daily Corpus of 1998, and the first version of Penn Chinese Treebank [15]. We compared our approach against two state-of-the-art systems: one is based on a bi-gram word segmentation model [7], and the other based on a wordbased hidden Markov model [3]. For simplicity, we only considered three kinds of unknown words (personal name, location name, and organization name) in the all methods.…”
Section: Experiments and Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…the one-month People's Daily Corpus of 1998, and the first version of Penn Chinese Treebank [15]. We compared our approach against two state-of-the-art systems: one is based on a bi-gram word segmentation model [7], and the other based on a wordbased hidden Markov model [3]. For simplicity, we only considered three kinds of unknown words (personal name, location name, and organization name) in the all methods.…”
Section: Experiments and Discussionmentioning
confidence: 99%
“…Chooi-Ling Goh et al [16], Jianfeng Gao et al [8] and Huaping Zhang [3] adopted character information to handle unknown words; X. Luo [11], Yao Meng [12] and Shengfen Luo [17] each presented characterbased parsing models for Chinese parsing or new-word extraction. T. Nakagawa used word-level information and character-level information for word segmentation [6].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…We utilized the free ICTCLAS [8] tool due to its mature performance, and the free NiuParser [9] tool due to its sophisticated models based on the statistic, such as Conditional Random Fields, Average Perceptron, Maximum Entropy, and Recurrent Neural Network.…”
Section: Methodsmentioning
confidence: 99%
“…ICT-CLAS [17] is adopted for segmentation and a stop word list including particle (e.g. , ), location words (e.g.…”
Section: Sentiment Featuresmentioning
confidence: 99%