Proceedings of the 2nd International Conference on Digital Access to Textual Cultural Heritage 2017
DOI: 10.1145/3078081.3078093
|View full text |Cite
|
Sign up to set email alerts
|

Digitization of Old Romanian Texts Printed in the Cyrillic Script

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2017
2017
2017
2017

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…For this, a very useful tool has been created and, for the first time ever, seventeenth-century Figure 4: The transformation of the coordination applied at the top of the tree, after all the other structural transformations Cyrillic Romanian letters have been made editable by an optical character recognizer (OCR) built in Chisinau (Colesnicov et al, 2016), (Cojocaru et al, 2017).…”
Section: Discussionmentioning
confidence: 99%
“…For this, a very useful tool has been created and, for the first time ever, seventeenth-century Figure 4: The transformation of the coordination applied at the top of the tree, after all the other structural transformations Cyrillic Romanian letters have been made editable by an optical character recognizer (OCR) built in Chisinau (Colesnicov et al, 2016), (Cojocaru et al, 2017).…”
Section: Discussionmentioning
confidence: 99%
“…These texts were cleaned of the meta-text, specialists comments and notes, then processed by a concordancer program that builds indexes and makes statistics of the number of occurrences for each form in all the 500 books in text format opened. (Cojocaru et al, 2017). In the case of the Romanian language written with Cyrillic, we used the same lexicon for the Old-Romanian POS-tagger and for the OCR program, having two variants, with the word forms written in Latin letters and written in Cyrillic letters, trying to help our colleagues to increase the performances of the OCR for Romanian Cyrillic letters.…”
Section: The Lexicon For the Old Ro Pos-taggermentioning
confidence: 99%
“…The letters differ from one book to another and there are many transitional alphabets with mixed Latin and Cyrillic letters. Our colleagues from the Institute of Mathematics and Computer Science of the Academy of Sciences of Moldova are now building an OCR which began operating satisfactorily for some texts (Cojocaru et al, 2017).…”
Section: The Lexicon For the Old Ro Pos-taggermentioning
confidence: 99%
“…The researchers of Chisinau have processed the print in two steps. In the first step, they obtained an editable form of the book with Old Cyrillic characters (Cojocaru et al, 2017). The computational linguists of Iasi have remarked that three characters are not in the "times extro" set.…”
Section: The Ocr Program For the Old Cyrillic Booksmentioning
confidence: 99%