Proceedings of the Second Workshop on Universal Dependencies (UDW 2018) 2018
DOI: 10.18653/v1/w18-6015
|View full text |Cite
|
Sign up to set email alerts
|

The First Komi-Zyrian Universal Dependencies Treebanks

Abstract: Two Komi-Zyrian treebanks were included in the Universal Dependencies 2.2 release. This article contextualizes the treebanks, discusses the process through which they were created, and outlines the future plans and timeline for the next improvements. Special attention is paid to the possibilities of using UD in the documentation and description of endangered languages.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
4
1
1

Relationship

2
8

Authors

Journals

citations
Cited by 23 publications
(22 citation statements)
references
References 10 publications
0
21
0
1
Order By: Relevance
“…Treebanks provide a valuable resource for both linguistic and language technology research. The work on Komi-Permyak (Rueter, Partanen & Ponomareva, 2020) and Komi-Zyrian (Partanen et al, 2018) treebanks have been coordinated so that they also contain parallel sentences. Additionally, the tagging schemes are aligned with one another.…”
Section: Related Workmentioning
confidence: 99%
“…Treebanks provide a valuable resource for both linguistic and language technology research. The work on Komi-Permyak (Rueter, Partanen & Ponomareva, 2020) and Komi-Zyrian (Partanen et al, 2018) treebanks have been coordinated so that they also contain parallel sentences. Additionally, the tagging schemes are aligned with one another.…”
Section: Related Workmentioning
confidence: 99%
“…[24,25,26]). Viime vuosina vastaavia oikoluettuja aineistoja on julkaistu laajemminkin [18,21], ja osasta on alettu muodostaa korpuksia [17] sekä puupankkeja [19]. Tekstintunnistuksesta alkavalla työllä voi siis osoittaa olevan kauaskantoisia vaikutuksia, ja tällaiset materiaalit otetaan nopeasti uuden tutkimuksen raakaaineiksi.…”
Section: Aiempi Tutkimusunclassified
“…Finally, I aim to have wide coverage of Uralic languages in the Universal Dependency project treebanks, and further study and experiment in the state-of-the-art methodology in large variety of NLP and typological research topics that have been empowered by the project. At the moment there are 6 Uralic treebanks available: Finnish (Haverinen et al, 2014;Voutilainen et al, 2012), Estonian (Muischnek et al, 2016), Hungarian (Vincze et al, 2010), North Saami (Sheyanova and Tyers, 2017), Komi (Partanen et al, 2018), and Erzya (Rueter and Tyers, 2018), out of some 30 that can easily have treebanks.…”
Section: Introductionmentioning
confidence: 99%