2022 26th International Conference on Pattern Recognition (ICPR) 2022
DOI: 10.1109/icpr56361.2022.9956488
|View full text |Cite
|
Sign up to set email alerts
|

Arbitrary Shape Text Detection using Transformers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 32 publications
0
8
0
Order By: Relevance
“…This broader scope allows for the detection of more complex behaviors. In addition, the exploration of alternative approaches to object detection and tracking, such as those based on vision transformers, represents another promising direction for advancing our analytic capabilities and improving the accuracy of anomaly detection algorithms [76,77]. These efforts will contribute to a more robust and sophisticated system for monitoring and securing museum environments.…”
Section: Future Workmentioning
confidence: 99%
“…This broader scope allows for the detection of more complex behaviors. In addition, the exploration of alternative approaches to object detection and tracking, such as those based on vision transformers, represents another promising direction for advancing our analytic capabilities and improving the accuracy of anomaly detection algorithms [76,77]. These efforts will contribute to a more robust and sophisticated system for monitoring and securing museum environments.…”
Section: Future Workmentioning
confidence: 99%
“…ABCNet fits directional or curved text with parametric Bezier curves and introduces only a small computational load. Using Bezier curves to express text boxes has also been widely used in other studies [27][28][29]. Bezier curves significantly enhance text detection's flexibility, but their computational process is not conducive to high detection speed.…”
Section: Related Workmentioning
confidence: 99%
“…Location and Shape Decoupling Module Previous works [47,56] use a single feed-forward neural network (FFN) to predict the control points in image space and train the network by applying L1 loss on the control points. However, as shown in Sec.…”
Section: Unified Detection Of Text Line and Paragraphmentioning
confidence: 99%
“…Notably, we find that the conventional way of training Bezier Curve polygon prediction head, i.e. applying L1 losses on control points directly [30,47,56], fails to capture text shapes accurately on highly diverse dataset such as Hi-erText [34]. Hence, we propose a novel Location and Shape Decoupling Module (LSDM) which decouples the representation learning of location and shape.…”
Section: Introductionmentioning
confidence: 99%