2020
DOI: 10.1017/s135132492000011x
|View full text |Cite
|
Sign up to set email alerts
|

Compression versus traditional machine learning classifiers to detect code-switching in varieties and dialects: Arabic as a case study

Abstract: The occurrence of code-switching in online communication, when a writer switches among multiple languages, presents a challenge for natural language processing tools, since they are designed for texts written in a single language. To answer the challenge, this paper presents detailed research on ways to detect code-switching in Arabic text automatically. We compare the prediction by partial matching (PPM) compression-based classifier, implemented in Tawa, and a traditional machine learning classifier sequentia… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 22 publications
0
1
0
Order By: Relevance
“…In this way, the speed of data extraction of each part is increased, and the grouping analysis algorithm is adopted to provide the targeted output of relevant languages. is step can be carried out before language communication, so as to greatly improve the speed of language conversion [9].…”
Section: Introductionmentioning
confidence: 99%
“…In this way, the speed of data extraction of each part is increased, and the grouping analysis algorithm is adopted to provide the targeted output of relevant languages. is step can be carried out before language communication, so as to greatly improve the speed of language conversion [9].…”
Section: Introductionmentioning
confidence: 99%