2019 International Conference on Document Analysis and Recognition (ICDAR) 2019
DOI: 10.1109/icdar.2019.00133
|View full text |Cite
|
Sign up to set email alerts
|

Manifold Mixup Improves Text Recognition with CTC Loss

Abstract: Modern handwritten text recognition techniques employ deep recurrent neural networks. The use of these techniques is especially efficient when a large amount of annotated data is available for parameter estimation. Data augmentation can be used to enhance the performance of the systems when data is scarce. Manifold Mixup is a modern method of data augmentation that meld two images or the feature maps corresponding to these images and the targets are fused accordingly. We propose to apply the Manifold Mixup to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 26 publications
0
9
0
Order By: Relevance
“…The historical Bentham dataset [61], which consists of images of letters by the English philosopher Jeremy Bentham (1748-1832), was utilized in the work of [47]. Furthermore, the English subset of the Maurdor dataset [62] was used in [34] and contains heterogeneous images of different types of documents. Finally, the dataset "GoodNotes Handwriting Kollection" (GNHK) [63] comprises unrestricted cameracaptured images of English handwritten text from various regions, characterized by diverse styles and increased noise, was used in the work of [38].…”
Section: Recognition Tasks and Datasetsmentioning
confidence: 99%
See 3 more Smart Citations
“…The historical Bentham dataset [61], which consists of images of letters by the English philosopher Jeremy Bentham (1748-1832), was utilized in the work of [47]. Furthermore, the English subset of the Maurdor dataset [62] was used in [34] and contains heterogeneous images of different types of documents. Finally, the dataset "GoodNotes Handwriting Kollection" (GNHK) [63] comprises unrestricted cameracaptured images of English handwritten text from various regions, characterized by diverse styles and increased noise, was used in the work of [38].…”
Section: Recognition Tasks and Datasetsmentioning
confidence: 99%
“…The RIMES dataset was used in the studies [27, 29-31, 34, 37, 40, 42, 46, 49]. The second dataset was the French subset of Maurdor, which was utilized in the study [34].…”
Section: Recognition Tasks and Datasetsmentioning
confidence: 99%
See 2 more Smart Citations
“…Other point cloud Mixup methods include Rigid SubSet Mixup [29] and Point MixSwap [49]. Mixup has also been investigated for LiDAR [55], graphs [53], speaker verification [70], vision-language navigation [32], single-view 3D reconstruction [10], and language processing [28,38,45,54,68]. We focus on Mixup for images, but our approach is generic and can be applied to many Mixup variants.…”
Section: Related Workmentioning
confidence: 99%