2016
DOI: 10.1007/s00521-016-2197-8
|View full text |Cite
|
Sign up to set email alerts
|

Querying out-of-vocabulary words in lexicon-based keyword spotting

Abstract: Puigcerver, J.; Toselli, AH.; Vidal, E. (2016). Querying out-of-vocabulary words in lexiconbased keyword spotting. Neural Computing and Applications. 1-10. doi:10.1007/s00521-016-2197-8. Abstract Lexicon-based handwritten text keyword spotting (KWS) has proven to be a faster and more accurate alternative to lexicon-free methods. Nevertheless, since lexicon-based KWS relies on a predefined vocabulary, fixed in the training phase, it does not support queries involving out-of-vocabulary (OOV) keywords. In this pa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…A word similarity can be computed in terms of character edit distances, possibly weighted by estimated optical dissimilarity between character pairs. Work exploring this idea, along with the use the Filler-HMM model as a back-off method, is presented in [43].…”
Section: Additional Results and Comparisonsmentioning
confidence: 99%
“…A word similarity can be computed in terms of character edit distances, possibly weighted by estimated optical dissimilarity between character pairs. Work exploring this idea, along with the use the Filler-HMM model as a back-off method, is presented in [43].…”
Section: Additional Results and Comparisonsmentioning
confidence: 99%
“…The idea is to smooth the (implicitly null) relevance probabilities of OOV keywords by relying on the indexed probabilities of "similar" in-vocabulary words. Most of our work in this direction is reviewed or presented in [49]. While reasonably good results are achieved with these methods, they always entail query response time penalties for OOV queriesand these penalties can become prohibitive for large collections of say hundreds of thousands or millions of images.…”
Section: Discussionmentioning
confidence: 99%
“…Moreover, it can recognise and retrieve results for words where there are historical or personal variations in spelling. Thus, this form of searching can produce useable results with HTR models that have higher error rates, up to 30 per cent CER (Giotis et al, 2017;Puigcerver et al, 2015Puigcerver et al, , 2017Retsinas et al, 2016;Strauß et al, 2016;Toselli et al, 2017). The platform displays the results of a Keyword Spotting query as a list of transcribed words, thumbnail images of the portion of the digitised pages on which those words appear and a confidence rating for each word.…”
Section: Contributementioning
confidence: 99%