2014 11th IAPR International Workshop on Document Analysis Systems 2014
DOI: 10.1109/das.2014.18
|View full text |Cite
|
Sign up to set email alerts
|

Sequential Word Spotting in Historical Handwritten Documents

Abstract: In this work we present a handwritten word spotting approach that takes advantage of the a priori known order of appearance of the query words. Given an ordered sequence of query word instances, the proposed approach performs a sequence alignment with the words in the target collection. Although the alignment is quite sparse, i.e. the number of words in the database is higher than the query set, the improvement in the overall performance is sensitively higher than isolated word spotting. As application dataset… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 27 publications
0
5
0
Order By: Relevance
“…Histogram of oriented gradient (HOG) is a technique which counts occurrences of the gradient orientation in the local part of an image. In [34], an extension of the HOG descriptor for a specific case of handwriting has been described. The combination of gradient features and a flexible plus adaptable grid has been used to extract features.…”
Section: ) Pixel Level Featuresmentioning
confidence: 99%
“…Histogram of oriented gradient (HOG) is a technique which counts occurrences of the gradient orientation in the local part of an image. In [34], an extension of the HOG descriptor for a specific case of handwriting has been described. The combination of gradient features and a flexible plus adaptable grid has been used to extract features.…”
Section: ) Pixel Level Featuresmentioning
confidence: 99%
“…Matching two document images has several applications related to information retrieval like spotting keywords in historical documents [1], accessing personal notes [2], camera based interface for querying [3], retrieving from video databases [4], automatic scoring of answer sheets [5], and mining and recommending in health care documents [6]. Since ocrs do not reliably work for all types of documents, one resorts to image based methods for comparing textual content.…”
Section: Introductionmentioning
confidence: 99%
“…We validate the effectiveness of our method on an application, named as measure of document similarity (mods). 1 mods compares two handwritten 1 In parallel to measure of software similarity (moss) [7], which has emerged as the de facto standard across the universities to compare two software solutions from students. Figure 1: (a) Given two document images D i and D j , we are interested in computing a similarity score S(D i , D j ) which is invariant to (i) writers, (ii) word flow across lines, (iii) spatial shifts, and (iv) paraphrasing.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…RELATED WORK The word-spotting problem has always attracted the interest of the pattern recognition community; the seminal works in the field were in speech recognition [27], while the first application of word-spotting to handwritten text was presented a little later by Manmatha et al [26]. Since then, the importance of indexing and browsing old handwritten books leads to a numerous interesting contributions [11], [37], making it an active area of research. Space does not allow an exhaustive review of the literature.…”
Section: Introductionmentioning
confidence: 99%