2015
DOI: 10.1007/978-3-319-27857-5_21
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Word Segmentation Method Based on Object Detection and Deep Learning

Abstract: A novel word segmentation method based on object detection and deep learning.In: Advances in Visual Computing: 11th International Symposium, ISVC 2015, Las Vegas, NV, USA, December 14-16, 2015 Abstract. The segmentation of individual words is a crucial step in several data mining methods for historical handwritten documents. Examples of applications include visual searching for query words (word spotting) and character-by-character text recognition. In this paper, we present a novel method for word segmentat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 13 publications
0
8
0
Order By: Relevance
“…It is a powerful and relatively light-weight model that achieves state-of-the-art results in segmentation-free word spotting. The Ctrl-F-Mini allows for searches using arbitrary text queries in full-page text images, and consists of a pre-activation ResNet34 backbone [16]; a bilinear interpolation layer that resizes region proposals into a canonical output size [17], [18]; an embedding network that embeds region proposals into a word embedding space; and external region proposals generated by the Dilated Text Proposals algorithm [19]. The output consists of a set of word proposals and corresponding scores that encode the probability of a proposal being centred on a word.…”
Section: A Word Spotting Modelmentioning
confidence: 99%
“…It is a powerful and relatively light-weight model that achieves state-of-the-art results in segmentation-free word spotting. The Ctrl-F-Mini allows for searches using arbitrary text queries in full-page text images, and consists of a pre-activation ResNet34 backbone [16]; a bilinear interpolation layer that resizes region proposals into a canonical output size [17], [18]; an embedding network that embeds region proposals into a word embedding space; and external region proposals generated by the Dilated Text Proposals algorithm [19]. The output consists of a set of word proposals and corresponding scores that encode the probability of a proposal being centred on a word.…”
Section: A Word Spotting Modelmentioning
confidence: 99%
“…The contributions of this paper include: 1) Two models for segmentation-free query-by-string word spotting are introduced: An end-to-end trainable model based on Faster R-CNN [18] and previous work [19], [20]; and a simplified version that performs equally well or better in certain situations. 2) Two novel data augmentation strategies for full manuscript pages, crucial for preventing model overfitting.…”
Section: Contributionsmentioning
confidence: 99%
“…As the RPN is a sliding window approach, complementing its proposals with an external region proposal method based on connected components is likely to improve the recall rate. To extract external proposals, we use the recently introduced method from [37], which is specially designed for manuscript images, and can be considered to be based on connected components. While the method is not given a name, we call it Dilated Text Proposals (DTP) to increase the clarity of this work.…”
Section: Dilated Text Proposalsmentioning
confidence: 99%