2021
DOI: 10.1016/j.eij.2020.12.001
|View full text |Cite
|
Sign up to set email alerts
|

Synthetic data with neural machine translation for automatic correction in arabic grammar

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
17
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(17 citation statements)
references
References 20 publications
0
17
0
Order By: Relevance
“…The traditional machine translation method is the SMT model, which counts information such as word pairs, parallel phrase pairs, and parallel syntactic structures from large-scale parallel corpus to establish a statistical model for the translation process. The research methods mainly include word-based statistical methods [12], phrase-based statistical methods [13], and syntactic structure-based statistical methods [14]. Among them, the phrase-based statistical machine translation takes phrases (that is, any consecutive words) as the basic translation unit, which can well solve the dependency relationship between the local contexts of sentences, and the translation quality has been greatly improved compared with the word-based statistical method.…”
Section: Introductionmentioning
confidence: 99%
“…The traditional machine translation method is the SMT model, which counts information such as word pairs, parallel phrase pairs, and parallel syntactic structures from large-scale parallel corpus to establish a statistical model for the translation process. The research methods mainly include word-based statistical methods [12], phrase-based statistical methods [13], and syntactic structure-based statistical methods [14]. Among them, the phrase-based statistical machine translation takes phrases (that is, any consecutive words) as the basic translation unit, which can well solve the dependency relationship between the local contexts of sentences, and the translation quality has been greatly improved compared with the word-based statistical method.…”
Section: Introductionmentioning
confidence: 99%
“…In the Decoder phase, the model still uses GRU units. The hidden layer state in the decoding network is called d = { d 1 , d 2 ,…, d Ty }, and the probability distribution information of the current output sequence y 1 , y 2 ,…, y Ty is as follows [ 25 ]: g is a function of the nonlinear multi-layer structure. The user calculates the probability distribution information of output y i .…”
Section: Methodsmentioning
confidence: 99%
“…Solyman et al [73] proposed an unsupervised method to generate large-scale synthetic training data to overcome the challenge of the scarcity of training data for Arabic. The method is based on confusion function to increase the amount of training set.…”
Section: ) Error Analysismentioning
confidence: 99%
“…Some research investigated MT samples with native speakers so they could review the linguistic aspects of MT errors [13], [40]; other research works used neural networks to detect errors [41], [73] or to correct them [42].…”
Section: ) Error Analysismentioning
confidence: 99%