2013 12th International Conference on Document Analysis and Recognition 2013
DOI: 10.1109/icdar.2013.248
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Classification of Structurally Similar Document Images

Abstract: In this paper, we present a learning based approach for computing structural similarities among document images for unsupervised exploration in large document collections. The approach is based on multiple levels of content and structure. At a local level, a bag-of-visual words based on SURF features provides an effective way of computing content similarity. The document is then recursively partitioned and a histogram of codewords is computed for each partition. Structural similarity is computed using a random… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
45
0
2

Year Published

2015
2015
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 40 publications
(47 citation statements)
references
References 21 publications
0
45
0
2
Order By: Relevance
“…The first version of the dataset, listed in the results as SmallTobacco, is a sample of 3482 images from the collection, selected and labelled in another work [20]. This version of the dataset was used in a number of related papers [20,22,17]. Each image has one of ten labels.…”
Section: A Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…The first version of the dataset, listed in the results as SmallTobacco, is a sample of 3482 images from the collection, selected and labelled in another work [20]. This version of the dataset was used in a number of related papers [20,22,17]. Each image has one of ten labels.…”
Section: A Datasetsmentioning
confidence: 99%
“…Each dataset was split into three subsets for the purposes of experimentation. The SmallTobacco dataset was split as in the related work [20,22,17]: 800 images were used for training, 200 for validation, and the remainder for testing. Since this is a small dataset, 10 random splits in those proportions were created; results reflect the median performance from those splits.…”
Section: A Datasetsmentioning
confidence: 99%
“…Local feature points, such as SIFT [21], are widely used as local features due to their description capabilities. Regarding the second step, image encoding, BOW were originally used to encode the feature point's distribution in a global image representation [12,16]. Fisher vectors and VLAD later showed improvement over the BOW [23,14].…”
Section: Previous Workmentioning
confidence: 99%
“…the structure of text blocks, figures, and tables. Such descriptions are finally used to perform classification [2,16] or to compute similarities [8,24]. The second type 1 , Ahmad Montaser Awal 2 , and Teddy Furon 1…”
Section: Introductionmentioning
confidence: 99%
“…In our application, however, the robust detection of text zones is a difficult task in itself for the same reason the OCR fails. Kumar and Doermann [11] use a bag-of-words approach with SURF features combined in hierarchical histograms in order to visually compare documents. While they focus on black and white documents, we also incorporate color documents.…”
Section: Related Workmentioning
confidence: 99%