2017
DOI: 10.1109/tmm.2016.2638622
|View full text |Cite
|
Sign up to set email alerts
|

Words Matter: Scene Text for Image Classification and Retrieval

Abstract: Abstract-Text in natural images typically adds meaning to an object or scene. In particular, text specifies which business places serve drinks (e.g. cafe, teahouse) or food (e.g. restaurant, pizzeria), and what kind of service is provided (e.g. massage, repair). The mere presence of text, its words and meaning are closely related to the semantics of the object or scene. This paper exploits textual contents in images for fine-grained business place classification and logo retrieval. There are four main contribu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
69
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 110 publications
(69 citation statements)
references
References 57 publications
0
69
0
Order By: Relevance
“…The most related work with our proposal in this paper is the one in [3], [20], where scene texts in the business place images are considered as the domain knowledge. The authors proposed to combine visual and textual cues for fine-grained business place classification.…”
Section: Related Work a Fine-grained Classificationmentioning
confidence: 99%
See 3 more Smart Citations
“…The most related work with our proposal in this paper is the one in [3], [20], where scene texts in the business place images are considered as the domain knowledge. The authors proposed to combine visual and textual cues for fine-grained business place classification.…”
Section: Related Work a Fine-grained Classificationmentioning
confidence: 99%
“…In [20], Bagof-Words are used to represent visual cues. Deep visual cues given by GoogleNet [5] are used in [3]. Sharing the same spirit as part-based methods, the authors proposed in [3] to extract 100 EdgeBoxes [21].…”
Section: Related Work a Fine-grained Classificationmentioning
confidence: 99%
See 2 more Smart Citations
“…Scene text retrieval [11,9], which aims at retrieving images based on text content, is closely related to scene text verification. The verification task could be seen as a subtask for scene text retrieval, as it only cares about the existence of text and no ranking is needed.…”
Section: Related Workmentioning
confidence: 99%