2004
DOI: 10.1007/978-3-540-28640-0_18
|View full text |Cite
|
Sign up to set email alerts
|

Multi-view hac for Semi-supervised Document Image Classification

Abstract: Abstract. This paper presents a semi-supervised document image classification system that aims to be integrated into a commercial document reading software. This system is asserted like an annotation help. From a set of unknown document images given by a human operator, the system computes regrouping hypothesis of same physical layout images and proposes them to the operator. Then he can correct them, validate them, keeping in mind that his objective is to have homogeneous groups of images. These groups will b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2010
2010
2013
2013

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 4 publications
0
1
0
Order By: Relevance
“…The authors proposed an algorithm based on a logistic regression classifier working on a set of manually segmented and labelled page images, followed by statistical classifiers for the logical layout analysis. A similar approach can be found in (Carmagnac, 2004a) in which a semisupervised document image classification system is presented. Given a set of unknown document images, the system uses unsupervised clustering to obtain grouping hypotheses for the same physical layout images.…”
Section: Related Workmentioning
confidence: 99%
“…The authors proposed an algorithm based on a logistic regression classifier working on a set of manually segmented and labelled page images, followed by statistical classifiers for the logical layout analysis. A similar approach can be found in (Carmagnac, 2004a) in which a semisupervised document image classification system is presented. Given a set of unknown document images, the system uses unsupervised clustering to obtain grouping hypotheses for the same physical layout images.…”
Section: Related Workmentioning
confidence: 99%