2011 International Conference on Document Analysis and Recognition 2011
DOI: 10.1109/icdar.2011.59
|View full text |Cite
|
Sign up to set email alerts
|

The SCRIBO Module of the Olena Platform: A Free Software Framework for Document Image Analysis

Abstract: Abstract-Electronic documents are being more and more usable thanks to better and more affordable network, storage and computational facilities. But in order to benefit from computeraided document management, paper documents must be digitized and analyzed. This task may be challenging at several levels. Data may be of multiple types thus requiring different adapted processing chains. The tools to be developed should also take into account the needs and knowledge of users, ranging from a simple graphical applic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 29 publications
(22 citation statements)
references
References 11 publications
0
22
0
Order By: Relevance
“…Therefore, the tools presented in this paper will be available in the next release of the Scribo module [21], as part of the Olena image processing platform [22], [23], along with an online demonstrator.…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, the tools presented in this paper will be available in the next release of the Scribo module [21], as part of the Olena image processing platform [22], [23], along with an online demonstrator.…”
Section: Discussionmentioning
confidence: 99%
“…Chen et al [9] proposes a method based on whitespace rectangles extraction and grouping: initially the foreground CCs are extracted and linked into chains according to their horizontal adjacency relationship; whitespace rectangles are then extracted from the gap between horizontally adjacent CCs; progressively CCs and whitespaces are grouped and filtered to form text lines and afterward text blocks. Lazzara et al [20] provides a chain of steps to first recognize text regions and successively non-text elements. Foreground CCs are extracted, then delimiters (such as lines, whitespaces and tab-stop) are detected with object alignment and morphological algorithms.…”
Section: Related Workmentioning
confidence: 99%
“…Chen et al 14 proposes a method based on whitespace rectangles extraction and grouping: initially the foreground CCs are extracted and linked into chains according to their horizontal adjacency relationship; whitespace rectangles are then extracted from the gap between horizontally adjacent CCs; progressively CCs and whitespaces are grouped and filtered to form text lines and afterward text blocks. Lazzara et al 15 provides a chain of steps to first recognize text regions and successively non-text elements. Foreground CCs are extracted, then delimiters (such as lines, whitespaces and tab-stop) are detected with object alignment and morphological algorithms.…”
Section: Related Workmentioning
confidence: 99%