2022
DOI: 10.3384/ecp190009
|View full text |Cite
|
Sign up to set email alerts
|

Exploring Linguistic Acceptability in Swedish Learners’ Language

Abstract: We present our initial experiments on binary classification of sentences into linguistically correct versus incorrect ones in Swedish using the DaLAJ dataset (Volodina et al., 2021a). The nature of the task is bordering on linguistic acceptability judgments, on the one hand, and on grammatical error detection task, on the other. The experiments include models trained with different input features and on different variations of the training, validation, and test splits. We also analyze the results focusing on d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 14 publications
0
0
0
Order By: Relevance
“…A number of derivative resources have been developed since 2016 based on the two SweLL corpora, such as wordlists for language learners -SweLLex [52] and later Sen*Lex [45] -for studies on lexical competences of L2 learners; DaLAJ [60] for studies on linguistic acceptability [61], CoDeRooMor [62] for studying derivational morphology of Swedish, MuClaGED [63] for error classification, synthetic datasets imitating real-life errors [64] and many others. The Swedish MultiGED dataset 11 based on SweLL-gold has been used for the MultiGED shared task [48] and we plan new shared tasks based on the SweLL corpora in the near future.…”
Section: Swell Impact: a Game Changer In Swedish L2?mentioning
confidence: 99%
“…A number of derivative resources have been developed since 2016 based on the two SweLL corpora, such as wordlists for language learners -SweLLex [52] and later Sen*Lex [45] -for studies on lexical competences of L2 learners; DaLAJ [60] for studies on linguistic acceptability [61], CoDeRooMor [62] for studying derivational morphology of Swedish, MuClaGED [63] for error classification, synthetic datasets imitating real-life errors [64] and many others. The Swedish MultiGED dataset 11 based on SweLL-gold has been used for the MultiGED shared task [48] and we plan new shared tasks based on the SweLL corpora in the near future.…”
Section: Swell Impact: a Game Changer In Swedish L2?mentioning
confidence: 99%
“…• advanced: C1 level (C2 missing in Coctaill). This version of DaLAJ is an official improved variant of the previously tested experimental version presented in Klezl et al (2022).…”
Section: Dataset Descriptionmentioning
confidence: 99%
“…Aceitabilidade linguística é a tarefa de determinar se uma sentenc ¸a está gramaticalmente correta. Essa tarefa vem do campo da linguística generativa [Klezl et al 2022] que se baseia em julgamentos intuitivos de falantes nativos sobre se uma sentenc ¸a é aceitável ou não [T Schütze 2016]. Essa tarefa possui diversas aplicac ¸ões na área de Processamento de Línguas Naturais (PLN), por exemplo: analisar a robustez de modelos de língua [Yin et al 2020] e verificar se tais modelos adquirem conhecimentos gramaticais [Zhang et al 2021, Choshen et al 2022.…”
Section: Introduc ¸ãOunclassified