2010 20th International Conference on Pattern Recognition 2010
DOI: 10.1109/icpr.2010.1137
|View full text |Cite
|
Sign up to set email alerts
|

Incorporating Linguistic Model Adaptation into Whole-Book Recognition

Abstract: Whole-book recognition is a document image analysis strategy that operates on the complete set of a book's page images using automatic adaptation to improve accuracy. Our algorithm expects to be given approximate iconic and linguistic models-derived from (generally errorful) OCR results and (generally incomplete) dictionaries-and then, guided entirely by evidence internal to the test set, corrects the models yielding improved accuracy. The iconic model describes image formation and determines the behavior of a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2011
2011
2014
2014

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 4 publications
0
3
0
Order By: Relevance
“…By substituting (8) and (9) into (4), the Bayesian model for topic determination is: (10) In fact, the best matched topic can be alternatively found by imposing logarithm operation on (10). (11) In this paper, the top largest -character transition probabilities are used as the topic features.…”
Section: Topic Language Model Adaptionmentioning
confidence: 99%
See 1 more Smart Citation
“…By substituting (8) and (9) into (4), the Bayesian model for topic determination is: (10) In fact, the best matched topic can be alternatively found by imposing logarithm operation on (10). (11) In this paper, the top largest -character transition probabilities are used as the topic features.…”
Section: Topic Language Model Adaptionmentioning
confidence: 99%
“…In speech recognition, the language model adaption explores contextual information and improves recognition accuracy. Xiu [10] integrated language model adaptation into whole book recognition and greatly decreased recognition errors. Wang [11] achieved apparent accuracy improvement on ancient domain text image recognition with language model adaption.…”
Section: Topic Language Model Adaption For Recognition Of I Introducmentioning
confidence: 99%
“…This agent implements the linguistic post-processing procedure in. 10 Parameters: (1) a set of words with their normal frequency ranges; (2) a default frequency range for new words (most are rare words).…”
Section: Agent 5 Word Frequency Constraintmentioning
confidence: 99%