Flexible and Robust Model Matching based on Association Graph for Form Image Understanding

Ishitani, Yasuto

doi:10.1007/pl00010982

Cited by 15 publications

(4 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The problems with these approaches are that the feature vector does not reflect the hierarchical layout of the forms. Another type of representation tries to utilise hierarchical structure of blocks as an X-Y tree [2,14,7]. X-Y tree representation is possible only for forms with boxes and thus has limited application area as many forms have horizontal marker lines instead of boxes.…”

Section: Past Workmentioning

confidence: 99%

A hierarchical method for automated identification and segmentation of forms

Mandal

Chowdhury

Das

et al. 2005

Eighth International Conference on Document Analysis and Recognition (ICDAR'05)

View full text Add to dashboard Cite

In this paper we propose a fully automatic hierarchical method for identification of forms using global as well as local features. Moments of certain orders are considered as global shape features and are utilised to reduce the search space by selecting a subset of forms present in the database. The type of the candidate form is then identified within this subset through detail analysis using local geometrical and topological features. The candidate form is then segmented to extract the user-filled information.

show abstract

Section: Past Workmentioning

confidence: 99%

A hierarchical method for automated identification and segmentation of forms

Mandal

Chowdhury

Das

et al. 2005

Eighth International Conference on Document Analysis and Recognition (ICDAR'05)

View full text Add to dashboard Cite

show abstract

“…In this case, a text region in each data field of a table is hierarchically extracted. This subsystem also extracts table features consisting of the geometric information of a table, that of ruled lines, and that of data fields from each table region [4]. In the proposed method, a data field is described as a rectangle and is called a cell.…”

Section: Figure 2 Document Transformation Systemmentioning

confidence: 99%

“…In the proposed method, the table having simplified cells is identified and is modified to construct the regular row structure of cells from the input table as follows. First, an isolated table is detected from the input document by ruled line extraction and grouping process [4]. Next, each cell is extracted from these ruled lines and is classified.…”

Section: Table Structure Analysis For Xml Document Transformationmentioning

confidence: 99%

Table structure analysis based on cell classification and cell modification for XML document transformation

Ishitani

Fume

Sumita

2005

Eighth International Conference on Document Analysis and Recognition (ICDAR'05)

Self Cite

View full text Add to dashboard Cite

A new method of table structure analysis based on cell classification and cell modification is proposed in this paper as the basis of an OCR which can convert a variety of printed tables into XML documents in accordance with a specified XML schema. The outline of this method is described as follows. Firstly, cell features defined by ruled lines, which correspond to data fields, are extracted from the input image of a table. After that, each cell is classified to identify the irregular table whose ruled lines are not gridded and is modified to form regular cell arrangement. Next, the hierarchical table structure consisting of a regular row structure of cells is extracted from the modified regular table and is described using a DOM tree. In this case, logical objects within a cell are extracted and are converted into a sub-tree in the DOM tree. Finally, this DOM tree is transformed into a target XML document by an XML parser with information extraction process. Experimental results show the method is effective in transforming various printed tables to various XML documents.

show abstract

“…The method is found to be hard to apply to analysis of filled-in form, because it is considered to be limited to empty fields. Ishitani[10] uses a hierarchical matching strategy based on sub-graph matching, local matching stages by line matching, and the interactions between them. But the similarity measure totally depends on the number of vertical and horizontal lines.…”

mentioning

confidence: 99%