2022
DOI: 10.3390/jimaging8100285
|View full text |Cite
|
Sign up to set email alerts
|

CorDeep and the Sacrobosco Dataset: Detection of Visual Elements in Historical Documents

Abstract: Recent advances in object detection facilitated by deep learning have led to numerous solutions in a myriad of fields ranging from medical diagnosis to autonomous driving. However, historical research is yet to reap the benefits of such advances. This is generally due to the low number of large, coherent, and annotated datasets of historical documents, as well as the overwhelming focus on Optical Character Recognition to support the analysis of historical documents. In this paper, we highlight the importance o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 48 publications
0
6
0
Order By: Relevance
“…Electronic copies of these books are available via the project's database, comprising over 70,000 pages, 23,000 of which contain visual elements. These visual elements were collected both manually and with the help of neural networks (Büttner et al, 2022). The Sphaera dataset is stored in a large knowledge graph modeled according to the CIDOC-CRM standards (Bekiari et al, 2021), where information about the editions, as well as fine-grained information about their content is stored (Kräutli & Valleriani, 2018;El-Hajj et al, 2022).…”
Section: Datasetmentioning
confidence: 99%
See 3 more Smart Citations
“…Electronic copies of these books are available via the project's database, comprising over 70,000 pages, 23,000 of which contain visual elements. These visual elements were collected both manually and with the help of neural networks (Büttner et al, 2022). The Sphaera dataset is stored in a large knowledge graph modeled according to the CIDOC-CRM standards (Bekiari et al, 2021), where information about the editions, as well as fine-grained information about their content is stored (Kräutli & Valleriani, 2018;El-Hajj et al, 2022).…”
Section: Datasetmentioning
confidence: 99%
“…The second added class refers to illustrations of material objects, namely those which one can refer to as machines. This data was collected from Branca (1629); Zonca (1607); Ramelli (1588); Besson (1595) using CorDeep (https://cordeep.mpiwg-berlin.mpg.de), a web service designed to extract and classify visual elements from historical documents (Büttner et al, 2022). In total, the dataset contains 5,879 pages distributed across the four classes, as shown in Table 1; each class is represented by a single sample in Fig.…”
Section: Datasetmentioning
confidence: 99%
See 2 more Smart Citations
“…Afterwards, the results were injected into Kraken [23] to generate text lines and the OCR output. Likewise, Büttner et al [24] applied a YoloV5 network to detect graphical elements, such as initials, decorations, printer's marks, or content illustrations, in different historical documents.…”
Section: Instance-level Segmentation For Page Layout Analysismentioning
confidence: 99%