2015
DOI: 10.1007/s10032-015-0245-z
|View full text |Cite
|
Sign up to set email alerts
|

A study of Bag-of-Visual-Words representations for handwritten keyword spotting

Abstract: The Bag-of-Visual-Words (BoVW) framework has gained popularity among the document image analysis community, specifically as a representation of handwritten words for recognition or spotting purposes. Although in the computer vision field the BoVW method has been greatly improved, most of the approaches in the document image analysis domain still rely on the basic implementation of the BoVW method disregarding such latest refinements. In this paper, we present a review of those improvements and its application … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
1
3

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 48 publications
(30 citation statements)
references
References 56 publications
0
26
1
3
Order By: Relevance
“…In [7,48], profile features were combined with the shape based structural features for a partial matching scheme using dtw. Although these features Variable Length Profile+Moments [46] Profile+DFT [40] Slit HoG [79] Local Gradient Histogram [58] HMM SIFT [63] SIFT [1,2,67,70,86] BoWs SIFT [3] Fisher SIFT [4] PHOC Attributes Deep Learning [37] Neural Codes Deep Learning [35,74,75,83] PHOC Attributes Deep Learning [24] Levenshtein Embedding are fast to compute, it is susceptible to noise and common degradation present in documents.…”
Section: Classical Representationmentioning
confidence: 99%
See 1 more Smart Citation
“…In [7,48], profile features were combined with the shape based structural features for a partial matching scheme using dtw. Although these features Variable Length Profile+Moments [46] Profile+DFT [40] Slit HoG [79] Local Gradient Histogram [58] HMM SIFT [63] SIFT [1,2,67,70,86] BoWs SIFT [3] Fisher SIFT [4] PHOC Attributes Deep Learning [37] Neural Codes Deep Learning [35,74,75,83] PHOC Attributes Deep Learning [24] Levenshtein Embedding are fast to compute, it is susceptible to noise and common degradation present in documents.…”
Section: Classical Representationmentioning
confidence: 99%
“…The popularity of bag of words (bow) [11,73] framework using local gradient features such as sift and hog, led to its proliferation to document images [1,2,67,68,70,86]. Rusinol et al [67,68], presented a patch based framework using bow histograms computed from the underlying sift descriptors.…”
Section: Bag Of Word Representationmentioning
confidence: 99%
“…Thus there is no explicit use of visual word positions within the image. Traditional visual words based methods suffer when faced with similar appearances but distinct semantic concepts (Aldavert et al, 2015). In this study, we assume that establishing spatial dependencies might be useful for preserving the spatial organization of data.…”
Section: Relative Conjunction Matrixmentioning
confidence: 99%
“…The Bag of word framework is among the popular and wellknown feature representation in information retrieval [8]. This method had been applied by J. Sivic and A. Zisserman in image and video retrieval field [9].…”
Section: Bag Of Word T Echniquesmentioning
confidence: 99%