2018
DOI: 10.1109/tmm.2017.2763323
|View full text |Cite
|
Sign up to set email alerts
|

Geo-Distinctive Visual Element Matching for Location Estimation of Images

Abstract: Abstract-We propose an image representation and matching approach that substantially improves visual-based location estimation for images. The main novelty of the approach, called distinctive visual element matching (DVEM), is its use of representations that are specific to the query image whose location is being predicted. These representations are based on visual element clouds, which robustly capture the connection between the query and visual evidence from candidate locations. We then maximize the influenc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 18 publications
(12 citation statements)
references
References 49 publications
0
12
0
Order By: Relevance
“…is publishing method displays multimedia elements such as text, pictures, sound effects, and animations, which are the main elements of information exchange through the terminal screen, giving people convenient access to information and communication [8]. In terms of physical properties, web pages are computer-based information carriers and a collection of thousands of networks worldwide [9]. From its social role, people will naturally regard the network as the fourth media after newspapers, radio, and television.…”
Section: Introductionmentioning
confidence: 99%
“…is publishing method displays multimedia elements such as text, pictures, sound effects, and animations, which are the main elements of information exchange through the terminal screen, giving people convenient access to information and communication [8]. In terms of physical properties, web pages are computer-based information carriers and a collection of thousands of networks worldwide [9]. From its social role, people will naturally regard the network as the fourth media after newspapers, radio, and television.…”
Section: Introductionmentioning
confidence: 99%
“…Text-based approaches are classified into two broad categories: geoparsing and Language Model-based (LM). Geotagging approaches based on the visual content of images, such as the ones by Hayes et al [16], [17], Lin et al [35], Weyand et al [54] and Li et al [32], offer another interesting alternative solution to the problem, which is, however, beyond the scope of this paper. Similarly, multimodal approaches that combine both text and visual content to produce location estimates, such as the ones by Crandall et al [9], Kelm et al [20], Trevisiol et al [49] and Cao et al [3] are not further considered here.…”
Section: Related Workmentioning
confidence: 99%
“…Here, we focus on a state-of-the-art GLE system called Distinctive Visual Element Matching approach, which was proposed by Li et al [21]. Given a query image, a geo-location is predicted based on the evidence collected from images that have sufficient visual similarity to the query image and are also geographically close.…”
Section: Visual-based Geo-location Estimationmentioning
confidence: 99%
“…Each cell is considered a class, and is used to train a CNN classifier. Inspired by the result reported in [31], we trained a deep network and compared with the retrieval based approach by Li et al [21].…”
Section: Visual-based Geo-location Estimationmentioning
confidence: 99%
See 1 more Smart Citation