2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00870
|View full text |Cite
|
Sign up to set email alerts
|

MOST: A Multi-Oriented Scene Text Detector with Localization Refinement

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
22
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 85 publications
(22 citation statements)
references
References 32 publications
0
22
0
Order By: Relevance
“…RRPN [13] adopts rotated anchors and RRoI pooling for detecting multi-oriented texts. Different from these anchor-based methods, anchor-free methods (e.g., EAST [16], MOST [19], and DDR [20]) directly regress the offsets from boundaries or vertexes to the current point for detecting texts. LOMO [12] introduces an iterative refinement module to iteratively refine the text localization of a direct regression based on bounding box proposals.…”
Section: A Regression-based Methodsmentioning
confidence: 99%
“…RRPN [13] adopts rotated anchors and RRoI pooling for detecting multi-oriented texts. Different from these anchor-based methods, anchor-free methods (e.g., EAST [16], MOST [19], and DDR [20]) directly regress the offsets from boundaries or vertexes to the current point for detecting texts. LOMO [12] introduces an iterative refinement module to iteratively refine the text localization of a direct regression based on bounding box proposals.…”
Section: A Regression-based Methodsmentioning
confidence: 99%
“…In multi-oriented scene text detection, regressionbased methods are popular, including anchor-based methods [28,39] and anchor-free methods [15,16,20,74]. They usually directly predict entire texts using a rotated bounding box or quadrangle.…”
Section: Multi-oriented Object Detectionmentioning
confidence: 99%
“…EAST [74] and DDR [16] perform rotated bounding box regression or vertex regression at each location. MOST [15] puts forward a set of strategies to improve the quality of text localization for long text significantly.…”
Section: Multi-oriented Object Detectionmentioning
confidence: 99%
“…While state-ofthe-art text detection systems such as [44,61] excel at localizing individual text entities, visual text understanding [2] requires comprehension of the semantic and geometric layout [5,7] of the textual content. In the current literature, most works focus on the individual tasks of text entities detection [3,18,61] and layout analysis [26,58] in a separate way, devoting all the power of deep learning models to task-specific performance. We argue that joint treatment of these two closely related problems can result not only in simpler and more efficient models, but also models that are more accurate across all tasks.…”
Section: Introductionmentioning
confidence: 99%
“…The division between text detection and geometric layout analysis tasks has led to parallel and separate research directions. Text detectors [14,18,40,61] usually treat word-level annotations, i.e. sequence of characters not interrupted by Figure 1.…”
Section: Introductionmentioning
confidence: 99%