2021
DOI: 10.1016/j.heliyon.2021.e08115
|View full text |Cite
|
Sign up to set email alerts
|

Augmented words to improve a deep learning-based Indonesian syllabification

Abstract: Recent deep learning-based syllabification models generally give low error rates for high-resource languages with big datasets but sometimes produce high error rates for the low-resource ones. In this paper, two procedures: massive data augmentation and validation, are proposed to improve a deep learning-based syllabification, using a combination of bidirectional long short-term memory (BiLSTM), convolutional neural networks (CNN), and conditional random fields (CRF) for a low-resource Indonesian language. The… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…According to Suyanto et al (2021), rule-based systems perform better for low-resourced languages with limited to no gold standard corpora. This was also demonstrated in the case of Sesotho where the rule-based system outperformed the T E Xbased patterning system (Sibeko and van Zaanen, 2022a).…”
Section: Orthographiesmentioning
confidence: 99%
“…According to Suyanto et al (2021), rule-based systems perform better for low-resourced languages with limited to no gold standard corpora. This was also demonstrated in the case of Sesotho where the rule-based system outperformed the T E Xbased patterning system (Sibeko and van Zaanen, 2022a).…”
Section: Orthographiesmentioning
confidence: 99%