Proceedings of the 11th Forum for Information Retrieval Evaluation 2019
DOI: 10.1145/3368567.3368579
|View full text |Cite
|
Sign up to set email alerts
|

Code-Mixed to Monolingual Translation Framework

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…As for work on CS MT, there are many efforts (Sinha and Thakur, 2005;Dhar et al, 2018;Mahata et al, 2019;Menacer et al, 2019;Song et al, 2019;Tarunesh et al, 2021;Xu and Yvon, 2021;Chen et al, 2022;Hamed et al, 2022c). To the best of our knowledge, none of these efforts presented an extensive comparison covering different segmentation techniques.…”
Section: Related Workmentioning
confidence: 99%
“…As for work on CS MT, there are many efforts (Sinha and Thakur, 2005;Dhar et al, 2018;Mahata et al, 2019;Menacer et al, 2019;Song et al, 2019;Tarunesh et al, 2021;Xu and Yvon, 2021;Chen et al, 2022;Hamed et al, 2022c). To the best of our knowledge, none of these efforts presented an extensive comparison covering different segmentation techniques.…”
Section: Related Workmentioning
confidence: 99%
“…Menacer et al [9] used this approach to translate Arabic-English code-mixed text into English. Mahata et al [15] used deep learning for Bengali-English code-mixed normalisation, employing a character-level long-short term memory network to perform language identification. Next, the English text was translated using a neural MT system, whilst the Bengali text was transliterated back to its Devanagri form.…”
Section: A Code-mixed Text Normalisationmentioning
confidence: 99%
“…The translation module translates codemixed text to monolingual text -here, Indonesian-English code-mixed text to Indonesian-language text. We slightly modified the MLF approach used by Barik et al by grouping neighbouring tokens with the same language in the text as one language segment; thus the translation is not performed for each token, but for each segment, similar to the approach taken in [15]. Because Barik et al separately applied translation to individual tokens, it potentially produces incorrect translation results since it prevents the MT system to know the correct context of the token.…”
Section: ) Translationmentioning
confidence: 99%
“…LID is critical for some subsequent natural language processing tasks in code-mixed documents ( Gundapu & Mamidi, 2018 ). Applying LID in the code-mixed text has become a foundation work of various NLP systems, including sentiment analysis ( Ansari & Govilkar, 2018 ; Mahata, Das & Bandyopadhyay, 2021 ), translation ( Barik, Mahendra & Adriani, 2019 ; Mahata et al, 2019 ), and emotion classification ( Yulianti et al, 2021 ). The absence of LID in pre-processing tasks can affect those NLP systems.…”
Section: Introductionmentioning
confidence: 99%