2019
DOI: 10.1016/j.rse.2019.04.014
|View full text |Cite
|
Sign up to set email alerts
|

Understanding urban landuse from the above and ground perspectives: A deep learning, multimodal solution

Abstract: This is the pre-acceptance version, to read the final version published in the journal Remote Sensing of Environment, please go to: https:// doi.org/10.1016/j.rse.2019.04.014 Landuse characterization is important for urban planning. It is traditionally performed with field surveys or manual photo interpretation, two practices that are time-consuming and labor-intensive.Therefore, we aim to automate landuse mapping at the urban-object level with a deep learning approach based on data from multiple sources (or m… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
97
1
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 128 publications
(99 citation statements)
references
References 49 publications
0
97
1
1
Order By: Relevance
“…Although the use of several sources of data (multimodal approaches) have proven to be beneficial for solving several problems it has been only applied in relatively few works described in this manuscript, such as in [19]. We believe that the performance of supervised methods used to improve OSM data can be greatly improved with the use of several data sources, such as images, tracking data, and social media data.…”
Section: Applications With Multimodal Datamentioning
confidence: 98%
“…Although the use of several sources of data (multimodal approaches) have proven to be beneficial for solving several problems it has been only applied in relatively few works described in this manuscript, such as in [19]. We believe that the performance of supervised methods used to improve OSM data can be greatly improved with the use of several data sources, such as images, tracking data, and social media data.…”
Section: Applications With Multimodal Datamentioning
confidence: 98%
“…Relating to the Buildings [5] andÎle-de-France land use [4] datasets, both were designed for multi-view scene classification. The first dataset contains 56, 259 paired aerial/street-level images of 4 different types of buildings, covering Washington DC.…”
Section: Related Workmentioning
confidence: 99%
“…• House: a single-family residence. 4 www.openstreetmap.org/ 5 https://overpass-turbo.eu/ 6 https://developers.google.com/maps/documentation/maps-static/intro 7 https://developers.google.com/maps/documentation/streetview/intro Figure 4, whereas the class distribution is presented in Figure 3. Regarding the images, all of them are 500×500 RGB images.…”
Section: B Cv-brctmentioning
confidence: 99%
See 1 more Smart Citation
“…This is again a semantic segmentation task. Finding borders between landuse classes is not trivial, which explains why researchers often complement their discriminative workflow with ground-based images (Srivastava et al, 2019).…”
Section: Fostering Applicationsmentioning
confidence: 99%