Proceedings of the 28th International Conference on Computational Linguistics: System Demonstrations 2020
DOI: 10.18653/v1/2020.coling-demos.1
|View full text |Cite
|
Sign up to set email alerts
|

Ve’rdd. Narrowing the Gap between Paper Dictionaries, Low-Resource NLP and Community Involvement

Abstract: We present an open-source online dictionary editing system, Ve rdd, that offers a chance to reevaluate and edit grassroots dictionaries that have been exposed to multiple amateur editors. The idea is to incorporate community activities into a state-of-the-art finite-state language description of a seriously endangered minority language, Skolt Sami. Problems involve getting the community to take part in things above the pencil-and-paper level. At times, it seems that the native speakers and the dictionary orien… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0
1

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

4
2

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 7 publications
0
8
0
1
Order By: Relevance
“…At the current, stage our dictionary editing system, Ve rdd [4,3], contains words for multiple endangered languages and their translations in a graph structure. This data could be extended by predicting new relations into the graph with semantic models such as word embeddings.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…At the current, stage our dictionary editing system, Ve rdd [4,3], contains words for multiple endangered languages and their translations in a graph structure. This data could be extended by predicting new relations into the graph with semantic models such as word embeddings.…”
Section: Discussionmentioning
confidence: 99%
“…In the UD we are dealing with, translation sentences might appear in the comments. The UD of the endangered languages can be obtained directly from Universal Dependencies' website 4 . At the time of writing, 1,690, 167, 104 and 435 sen-tences were in Erzya's [44], Moksha's [42], Skolt Sami's [30] and Komi-Zyrian's UDs 5 [32], respectively.…”
Section: Universal Dependenciesmentioning
confidence: 99%
“…Our system Ve rdd [1] was the reason I got an opportunity to visit the Sami Culture Center Sajos 5 in Inari, Finland to collaborate with two Skolt Sami dictionary editors. Skolt Sami (sms) is a severely endangered language with only 300 native speakers according to UNESCO.…”
Section: Endangered But How Endangered?mentioning
confidence: 99%
“…It is noteworthy, that mostly, in lexicography and grammar, tests, are nothing more than example word forms and their preferred annotations, be it grammatical analysis or perhaps spelling corrections, this, naturally, is in and of itself interesting for a linguist even without the testing feature. As a recent feature for the documentation comments, the output format has been updated to GitHub's github-pages format and a prototypical examples can be found at the time of writing in the GiellaLT github space 18 .…”
Section: Documentationmentioning
confidence: 99%