2012
DOI: 10.1142/s0218001412630025
|View full text |Cite
|
Sign up to set email alerts
|

On the Influence of Word Representations for Handwritten Word Spotting in Historical Documents

Abstract: Word spotting is the process of retrieving all instances of a queried keyword from a digital library of document images. In this paper we evaluate the performance of different word descriptors to assess the advantages and disadvantages of statistical and structural models in a framework of query-by-example word spotting in historical documents. We compare four word representation models, namely sequence alignment using DTW as a baseline reference, a bag of visual words approach as statistical model, a pseudo-s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 43 publications
(20 citation statements)
references
References 36 publications
0
20
0
Order By: Relevance
“…In the document image analysis literature, we can distinguish two different families of keyword spotting methods depending on the representation of the handwritten words [26]. On the one hand, sequential word representations [35] describe handwritten words as a time series by using a sliding window in the writing direction.…”
Section: Introductionmentioning
confidence: 99%
“…In the document image analysis literature, we can distinguish two different families of keyword spotting methods depending on the representation of the handwritten words [26]. On the one hand, sequential word representations [35] describe handwritten words as a time series by using a sliding window in the writing direction.…”
Section: Introductionmentioning
confidence: 99%
“…In word spotting literature, dynamic time warping (DTW) is one of the most commonly used methods to calculate the similarity of words [9,18,33,43,46,48]. DTW can tolerate spatial variations unlike other methods such as XOR, Euclidean Distance Mapping, Sum of Squared Differences [47].…”
Section: Related Workmentioning
confidence: 99%
“…The first one consists of 27 pages from a collection of marriage registers in the Barcelona Cathedral from 1451 to 1905 [15]. For the second evaluation corpus, we select part of the IAM off-line dataset, which is the biggest collection of a unique writing style.…”
Section: Experimental Data and Evaluation Criteriamentioning
confidence: 99%