Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 2021
DOI: 10.18653/v1/2021.findings-acl.75
|View full text |Cite
|
Sign up to set email alerts
|

Can Cognate Prediction Be Modelled as a Low-Resource Machine Translation Task?

Abstract: Cognate prediction is the task of generating, in a given language, the likely cognates of words in a related language, where cognates are words in related languages that have evolved from a common ancestor word. It is a task for which little data exists and which can aid linguists in the discovery of previously undiscovered relations. Previous work has applied machine translation (MT) techniques to this task, based on the tasks' similarities, without, however, studying their numerous differences or optimising … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 33 publications
0
5
0
Order By: Relevance
“…The new framework has the advantage of being easy to use, easy to extend, and fast to apply, while at the same time yielding promising results on a newly compiled collection of datasets from three different languages families. Given that our framework can be easily extended, by varying the individual components of the worfklow, we hope that it will provide a solid basis for future work on phonological reconstruction, as well as the prediction of words from cognate reflexes (Bodt and List, 2022;Dekker and Zuidema, 2021;Beinborn et al, 2013;Fourrier et al, 2021) in computational historical linguistics.…”
Section: Discussionmentioning
confidence: 99%
“…The new framework has the advantage of being easy to use, easy to extend, and fast to apply, while at the same time yielding promising results on a newly compiled collection of datasets from three different languages families. Given that our framework can be easily extended, by varying the individual components of the worfklow, we hope that it will provide a solid basis for future work on phonological reconstruction, as well as the prediction of words from cognate reflexes (Bodt and List, 2022;Dekker and Zuidema, 2021;Beinborn et al, 2013;Fourrier et al, 2021) in computational historical linguistics.…”
Section: Discussionmentioning
confidence: 99%
“…If a method has systematic errors but otherwise does a good job in prediction, B-Cubed F-Scores penalize results less strongly than edit distance. As a final evaluation score, we followed Fourrier et al (2021) in providing BLEU scores (Papineni et al, 2002). These scores are usually used to investigate how well an automated translation corresponds to the translated target test.…”
Section: Discussionmentioning
confidence: 99%
“…The task can be seen as a form of zero-shot learning (Xian et al, 2018), where a model must learn to predict the "reflexes" of a potentially unknown ancestral word form, with no examples of the relevant cognate set provided during the training phase. When considering the landscape of machine learning methods available and the approaches so far proposed (Dinu and Ciobanu, 2014;Bodt and List, 2022;Meloni et al, 2021;Beinborn et al, 2013;Dekker and Zuidema, 2021;Fourrier et al, 2021;List et al, 2022a), including other submissions to this challenge (Jäger, 2022;Celano, 2022;Kirov et al, 2022), it is possible to identify two main strategies for the task. The first one treats the problem as one of classification, potentially refining sequence results with probabilities from a character model, while the second employs sequence transformation methods, especially those akin to seq2seq approaches (Sutskever et al, 2014), making the task one analogous to that of "translation".…”
Section: Introductionmentioning
confidence: 99%