2004
DOI: 10.1016/j.ins.2003.05.015
|View full text |Cite
|
Sign up to set email alerts
|

Thick 2D relations for document understanding

Abstract: We use a propositional language of qualitative rectangle relations to detect the reading order from document images. To this end, we define the notion of a document encoding rule and we analyze possible formalisms to express document encoding rules such as L A T E X and SGML. Document encoding rules expressed in the propositional language of rectangles are used to build a reading order detector for document images. In order to achieve robustness and avoid brittleness when applying the system to real life docum… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2005
2005
2013
2013

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…Structural representations stand as a kind of alternative approach with some preliminary results, but to be further investigated. Graph-Based Representation of an image is a rich domain, relations between regions or points of interest can be modeled in many ways, among them, we can cite the representations issued from Delaunay triangulation [84], Allen algebra [83] or a neighboring graph.…”
Section: Analysis and Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Structural representations stand as a kind of alternative approach with some preliminary results, but to be further investigated. Graph-Based Representation of an image is a rich domain, relations between regions or points of interest can be modeled in many ways, among them, we can cite the representations issued from Delaunay triangulation [84], Allen algebra [83] or a neighboring graph.…”
Section: Analysis and Discussionmentioning
confidence: 99%
“…We can mention GBR methods using Bi-dimensional Allen Algebra (Ref. [83]) or Delaunay triangulation (Ref. [84]).…”
Section: Recognition Rate Comparison Gbr Vs Bowmentioning
confidence: 99%
“…The output of the spatial reasoner is a (cyclic) graph where edges represent instances of the partial ordering relation BeforeInReading. A reading order is then defined as a full path in this graph, and is determined by means of an extension of a standard topological sort [12]. Due to the generality of the document encoding rule used by the spatial reasoner, it is likely that one obtains more than one reading order, especially for complex documents with many blocks.…”
Section: Background and Related Workmentioning
confidence: 99%
“…In [Aiello et al, 2002] we presented a system which, by using spatial reasoning [Aiello and Smeulders, 2004] and text processing techniques [Sparck Jones, 1972, Salton and McGill, 1983, Baeza-Yates and Ribeiro-Neto, 1999, is able to extract the reading order from document images for documents having very different layouts. In particular, a spatial language for rectangles [Balbiani et al, 1998], based on the work on interval relations of Allen [Allen, 1983], is used to describe both the elements in a document and general rules to analyze them.…”
Section: Three Algorithms For Article Clusteringmentioning
confidence: 99%