Proceedings of the Eighth SIGHAN Workshop on Chinese Language Processing 2015
DOI: 10.18653/v1/w15-3121
|View full text |Cite
|
Sign up to set email alerts
|

NTOU Chinese Spelling Check System in Sighan-8 Bake-off

Abstract: This paper describes details of NTOU Chinese spelling check system in SIGHAN-8 Bakeoff. Besides the basic architecture of the previous system participating in last two CSC tasks, three new preference rules were proposed to deal with Simplified Chinese characters, variants, sentence-final particles, and DE-particles. A new sentence likelihood function was proposed based on frequencies of space-removed version of Google n-gram datasets. Two formal runs were submitted where the best one was created by the system … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2019
2019
2025
2025

Publication Types

Select...
3
3
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(14 citation statements)
references
References 17 publications
(13 reference statements)
0
14
0
Order By: Relevance
“…The early days of CSC focused on error detection and correction by means of rules, the design of which required linguistic knowledge and feature engineering [8]. Lack of migration and generalization capabilities.…”
Section: Related Work 21 Chinese Spell Checkingmentioning
confidence: 99%
“…The early days of CSC focused on error detection and correction by means of rules, the design of which required linguistic knowledge and feature engineering [8]. Lack of migration and generalization capabilities.…”
Section: Related Work 21 Chinese Spell Checkingmentioning
confidence: 99%
“…We compare REALISE with the following baselines: KUAS , NTOU (Chu and Lin, 2015), NCTU-NTUT (Wang and Liao, 2015), HanSpeller++ , LMC (Xie et al, 2015) mainly utilize heuristics or traditional machine learning algorithms, such as n-gram language model, Conditional Random Field and Hidden Markov Model. Sequence Labeling (Wang et al, 2018) treats CSC as a sequence labeling problem and applies a BiLSTM model.…”
Section: Baselinesmentioning
confidence: 99%
“…The CSC task is to detect and correct spelling errors in Chinese sentences. Early works design various rules to deal with different errors Chu and Lin, 2015). Next, traditional machine learning algorithms are brought to this field, such as Conditional Random Field and Hidden Markov Model (Wang and Liao, 2015;.…”
Section: Chinese Spell Checkingmentioning
confidence: 99%
“…Transfer Learning using BERT Spell check requires analyzing the syntactic plausibility of sentences. In Chinese, a common spelling mistake is confusing the pairs 的 and 地 (Chu and Lin, 2015). Both has the same pronunciation ("de").…”
Section: Improving Chinese Spell Check Bymentioning
confidence: 99%