Findings of the Association for Computational Linguistics: EMNLP 2023 2023
DOI: 10.18653/v1/2023.findings-emnlp.923
|View full text |Cite
|
Sign up to set email alerts
|

Dialect-to-Standard Normalization: A Large-Scale Multilingual Evaluation

Olli Kuparinen,
Aleksandra Miletić,
Yves Scherrer

Abstract: Text normalization methods have been commonly applied to historical language or usergenerated content, but less often to dialectal transcriptions. In this paper, we introduce dialect-to-standard normalization -i.e., mapping phonetic transcriptions from different dialects to the orthographic norm of the standard variety -as a distinct sentence-level character transduction task and provide a large-scale analysis of dialect-to-standard normalization methods. To this end, we compile a multilingual dataset covering… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
references
References 24 publications
0
0
0
Order By: Relevance