2009 10th International Conference on Document Analysis and Recognition 2009
DOI: 10.1109/icdar.2009.271
|View full text |Cite
|
Sign up to set email alerts
|

A Realistic Dataset for Performance Evaluation of Document Layout Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
59
0
2

Year Published

2010
2010
2023
2023

Publication Types

Select...
4
3
2

Relationship

2
7

Authors

Journals

citations
Cited by 110 publications
(61 citation statements)
references
References 2 publications
0
59
0
2
Order By: Relevance
“…It is used in high-profile applications such as evaluation datasets for layout analysis of contemporary documents [9], datasets and extensive evaluation infrastructure for historical documents (within the scope of the IMPACT project) as well as international competitions (ICDAR competition series) [10].…”
Section: Discussionmentioning
confidence: 99%
“…It is used in high-profile applications such as evaluation datasets for layout analysis of contemporary documents [9], datasets and extensive evaluation infrastructure for historical documents (within the scope of the IMPACT project) as well as international competitions (ICDAR competition series) [10].…”
Section: Discussionmentioning
confidence: 99%
“…The importance of the availability of realistic datasets for meaningful performance evaluation has been repeatedly discussed and the authors have addressed the issue for contemporary documents by creating the PRImA Layout Analysis dataset with ground truth [4] and making it available to all researchers. The overall dataset contains a wide selection of contemporary documents (with complex as well as simple layouts) together with comprehensive ground truth and extensive metadata.…”
Section: The Datasetmentioning
confidence: 99%
“…Since the 2009 edition of the ICDAR Page Segmentation competition a more extensive evaluation scheme has been used [3], allowing for higher level goal-oriented evaluation and much more detailed region comparison, going far beyond simple precision/recall metrics. In addition, the used datasets have been selected from curated repositories [4][5] containing realistic and representative documents. This edition (RDCL2017) is based on the same principles established and refined by the 2011, 2013, and 2015 competitions on historical and contemporary document layout analysis [6] but its focus is on documents with complex layouts.…”
Section: Introductionmentioning
confidence: 99%
“…For instance, Antonacopoulos et al 26 proposed a dataset to evaluate techniques of document layout analysis. That dataset contains 1, 240 images from websites, newspaper pages, magazines pages.…”
Section: Evaluation Datasetsmentioning
confidence: 99%