2014
DOI: 10.5788/17-0-554
|View full text |Cite
|
Sign up to set email alerts
|

Dictionary Writing System (DWS) + Corpus Query Package (CQP): The Case of TshwaneLex

Abstract: Abstract:In this article the integrated corpus query functionality of the dictionary compilation software TshwaneLex is analysed. Attention is given to the handling of both raw corpus data and annotated corpus data. With regard to the latter it is shown how, with a minimum of human effort, machine learning techniques can be employed to obtain part-of-speech tagged corpora that can be used for lexicographic purposes. All points are illustrated with data drawn from English and Northern Sotho. The tools and techn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(1 citation statement)
references
References 6 publications
0
1
0
Order By: Relevance
“…Van Sterkenburg 2003: 195 ff. andDe Pauw 2007 on the development of digital resources). Even if electronic text corpora had been produced by scanning, they would have been of only limited value as the revised and standardised orthography of Khoekhoegowab had not yet established itself in the literature and lemmas would hence not have been recognised automatically.…”
Section: The Beginnings Of the Nama Dictionary Projectmentioning
confidence: 99%
“…Van Sterkenburg 2003: 195 ff. andDe Pauw 2007 on the development of digital resources). Even if electronic text corpora had been produced by scanning, they would have been of only limited value as the revised and standardised orthography of Khoekhoegowab had not yet established itself in the literature and lemmas would hence not have been recognised automatically.…”
Section: The Beginnings Of the Nama Dictionary Projectmentioning
confidence: 99%