Proceedings of ArabicNLP 2023 2023
DOI: 10.18653/v1/2023.arabicnlp-1.30
|View full text |Cite
|
Sign up to set email alerts
|

Arabic dialect identification: An in-depth error analysis on the MADAR parallel corpus

Helene Olsen,
Samia Touileb,
Erik Velldal

Abstract: This paper provides a systematic analysis and comparison of the performance of state-of-theart models on the task of fine-grained Arabic dialect identification using the MADAR parallel corpus. We test approaches based on pretrained transformer language models in addition to Naive Bayes models with a rich set of various features. Through a comprehensive data-and error analysis, we provide valuable insights into the strengths and weaknesses of both approaches. We discuss which dialects are more challenging to di… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 13 publications
0
0
0
Order By: Relevance