2021
DOI: 10.1007/s10579-021-09558-0
|View full text |Cite
|
Sign up to set email alerts
|

Resources for Turkish dependency parsing: introducing the BOUN Treebank and the BoAT annotation tool

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 28 publications
0
8
0
Order By: Relevance
“…This treebank contains linguistic examples from a grammar book to increase the coverage of different morphosyntactic constructions while minimizing the annotation effort. Two relatively larger and more recent dependency treebanks are the Boğaziçi University (BOUN) treebank (Türk et al, 2021) and the Turkish web treebank (TWT, Kayadelen et al, 2020). The BOUN treebank annotates a selection of sentences from the TNC (Aksan et al, 2012, see Section 2.1) covering a number of different text types.…”
Section: Treebankmentioning
confidence: 99%
“…This treebank contains linguistic examples from a grammar book to increase the coverage of different morphosyntactic constructions while minimizing the annotation effort. Two relatively larger and more recent dependency treebanks are the Boğaziçi University (BOUN) treebank (Türk et al, 2021) and the Turkish web treebank (TWT, Kayadelen et al, 2020). The BOUN treebank annotates a selection of sentences from the TNC (Aksan et al, 2012, see Section 2.1) covering a number of different text types.…”
Section: Treebankmentioning
confidence: 99%
“…It uses a validation script developed by ud to display errors. boat-v1 was used to create the boun Treebank [2,8] -a manually annotated Turkish dependency treebank comprising close to 10 thousand sentences.…”
Section: Boat-v1mentioning
confidence: 99%
“…Annotation tools with drag-drop and mouse-based interfaces, although appealing, are not well suited for agglutinative languages as they require alternating among input modalities, disrupting the flow. boat-v1 [2] is an annotation tool that was developed to support dependency annotation of morphologically rich languages (MRLs) to produce treebanks compliant with the ud framework [1]. The experience during the use of it revealed several points of improvement for such annotation tools.…”
Section: Introductionmentioning
confidence: 99%
“…We evaluated the Stanford's neural parser as the baseline system and the proposed hybrid parser with rule-based and morphology-based enhancement methods on the IMST-UD Treebank, 5 the Turkish PUD Treebank [51], and the BOUN Treebank [52] which is a newly introduced treebank for Turkish. In all of the experiments, the default set of parameters are used for the deep network that produces the parse trees.…”
Section: Experiments a Experimental Settingsmentioning
confidence: 99%
“…The training set of the IMST-UD Treebank consists of 3,685 sentences. To be able to observe the effect of the gradual increase of the training data more accurately, we additionally used the BOUN Treebank [52], a newly introduced Turkish treebank annotated in UD style. Being the largest dependency treebank in Turkish, the BOUN Treebank includes a total of 9,761 manually annotated sentences (7,803 training, 979 development, and 979 test sentences) from various topics including biographical texts, national newspapers, instructional texts, popular culture articles, and essays.…”
Section: The Effect Of Training Data Sizementioning
confidence: 99%