2021
DOI: 10.1075/ijlcr.20005.rub
|View full text |Cite
|
Sign up to set email alerts
|

Assessing the impact of automatic dependency annotation on the measurement of phraseological complexity in L2 Dutch

Abstract: The extraction of phraseological units operationalized in phraseological complexity measures (Paquot, 2019) relies on automatic dependency annotations, yet the suitability of annotation tools for learner language is often overlooked. In the present article, two Dutch dependency parsers, Alpino (van Noord, 2006) and Frog (van den Bosch et al., 2007), are evaluated for their performance in automatically annotating three types of dependency relations (verb + direct object, adjectival modifier, and adverbial modif… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 23 publications
0
4
0
Order By: Relevance
“…We will first perform the annotation of verbal tenses in both corpora using our script. However, since automatic language processing tools are developed on the basis of well-formed data, it is to be expected that learner corpora, due to the inclusion of errors, will lead to annotation errors (Granger, 2011;Štindlová et al, 2011;Krivanek and Meurers, 2013;Rubin, 2021;. Therefore, we decided to also manually annotate the learner corpus.…”
Section: Manual Annotationmentioning
confidence: 99%
“…We will first perform the annotation of verbal tenses in both corpora using our script. However, since automatic language processing tools are developed on the basis of well-formed data, it is to be expected that learner corpora, due to the inclusion of errors, will lead to annotation errors (Granger, 2011;Štindlová et al, 2011;Krivanek and Meurers, 2013;Rubin, 2021;. Therefore, we decided to also manually annotate the learner corpus.…”
Section: Manual Annotationmentioning
confidence: 99%
“…For more precise insights into linguistic development, some studies have constrained the lexical combinations that are used (e.g., adjective + noun or noun + noun combinations). Even more recently, researchers have begun to use dependency parses to analyze lexical items in particular grammatical relationships (e.g., verb + object; Kyle & Eguchi, 2021;Paquot, 2018Paquot, , 2019Rubin, 2021). Lexicogrammatical features: A number of studies have investigated the relationship between L2 proficiency and the use of lexicogrammatical features that are common in academic writing such as various types of noun phrase elaboration (e.g., Biber et al, 2014;Grant & Ginther, 2000;Picoral et al, 2021).…”
Section: Lexical Bigramsmentioning
confidence: 99%
“…evaluated the accuracy of verb argument construction identification using a sample of 100 sentences from a corpus of L2 essays. Similar procedures have been used in a number of other studies (e.g., Díez-Bedmar & Pérez-Paredes, 2020;Paquot, 2019;Rubin, 2021). While small-scale accuracy analyses are important for establishing the effectiveness of particular linguistic analysis tools for a particular data set, these datasets are rarely made publicly available and do not necessarily follow the annotation guidelines or formatting conventions of well-known treebanks.…”
Section: Evaluations Of System Performance On L2 Datamentioning
confidence: 99%
“…We will first perform the annotation of verbal tenses in both corpora using our script. However, since automatic language processing tools are developed on the basis of well-formed data, it is to be expected that learner corpora, due to the inclusion of errors, will lead to annotation errors (Granger, 2011;Štindlová et al, 2011;Krivanek and Meurers, 2013;Rubin, 2021;Volodina et al, 2022). Therefore, we decided to also manually annotate the learner corpus.…”
Section: Manual Annotationmentioning
confidence: 99%