Proceedings of the Workshop on Language Technology for Digital Humanities in Central and (South-)Eastern Europe 2017
DOI: 10.26615/978-954-452-046-5_002
|View full text |Cite
|
Sign up to set email alerts
|

Tools for Building a Corpus to Study the Historical and Geographical Variation of the Romanian Language

Abstract: Contemporary standard language corpora are ideal for NLP. There are few morphologically and syntactically annotated corpora for Romanian, and those existing or in progress only deal with the Contemporary Romanian standard. However, the necessity to study the dynamics of natural languages gave rise to balanced corpora, containing non-standard texts. In this paper, we describe the creation of tools for processing non-standard Romanian to build a big balanced corpus. We want to preserve in annotated form as many … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 7 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?