Supervised Classification of Multisensor Remotely Sensed Images Using a Deep Learning Framework

Piramanayagam, Sankaranarayanan; Saber, Eli; Schwartzkopf, W.; Koehler, Frederick W.

doi:10.3390/rs10091429

Cited by 80 publications

(50 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The reason why we choose Infrared, Red, and Green band data is mainly for consistency with the ISPRS Vaihingen dataset. Besides, this is also for fair comparison with models that use IRRG data only, like RIT_2 [47].…”

Section: Dataset Descriptionmentioning

confidence: 99%

Context Aggregation Network for Semantic Labeling in Aerial Images

Cheng

Yang

Wang³

et al. 2019

Remote Sensing

View full text Add to dashboard Cite

Semantic labeling for high resolution aerial images is a fundamental and necessary task in remote sensing image analysis. It is widely used in land-use surveys, change detection, and environmental protection. Recent researches reveal the superiority of Convolutional Neural Networks (CNNs) in this task. However, multi-scale object recognition and accurate object localization are two major problems for semantic labeling methods based on CNNs in high resolution aerial images. To handle these problems, we design a Context Fuse Module, which is composed of parallel convolutional layers with kernels of different sizes and a global pooling branch, to aggregate context information at multiple scales. We propose an Attention Mix Module, which utilizes a channel-wise attention mechanism to combine multi-level features for higher localization accuracy. We further employ a Residual Convolutional Module to refine features in all feature levels. Based on these modules, we construct a new end-to-end network for semantic labeling in aerial images. We evaluate the proposed network on the ISPRS Vaihingen and Potsdam datasets. Experimental results demonstrate that our network outperforms other competitors on both datasets with only raw image data.

show abstract

Section: Dataset Descriptionmentioning

confidence: 99%

Context Aggregation Network for Semantic Labeling in Aerial Images

Cheng

Yang

Wang³

et al. 2019

Remote Sensing

View full text Add to dashboard Cite

show abstract

“…In recent years, deep learning methods have been broadly utilized in various remote sensing image-based applications, including object detection [2,3,20], scene classification [21,22], land cover, and land use mapping [23,24]. Since it was proposed in 2014, deep convolutional neural network (CNN)-based semantic segmentation algorithms [25] have been applied to many pixel-wise remote sensing image analysis tasks, such as road extraction, building extraction, urban land use classification, maritime semantic labeling, vehicle extraction, damage mapping, weed mapping, and other land cover mapping tasks [5,6,[26][27][28][29][30][31]. Several recent studies used semantic segmentation methods for building extraction from remote sensing images [9][10][11][12][32][33][34][35][36][37][38].…”

Section: Introductionmentioning

confidence: 99%

Semantic Segmentation-Based Building Footprint Extraction Using Very High-Resolution Satellite Images and Multi-Source GIS Data

Fang

et al. 2019

Remote Sensing

189

View full text Add to dashboard Cite

Automatic extraction of building footprints from high-resolution satellite imagery has become an important and challenging research issue receiving greater attention. Many recent studies have explored different deep learning-based semantic segmentation methods for improving the accuracy of building extraction. Although they record substantial land cover and land use information (e.g., buildings, roads, water, etc.), public geographic information system (GIS) map datasets have rarely been utilized to improve building extraction results in existing studies. In this research, we propose a U-Net-based semantic segmentation method for the extraction of building footprints from high-resolution multispectral satellite images using the SpaceNet building dataset provided in the DeepGlobe Satellite Challenge of IEEE Conference on Computer Vision and Pattern Recognition 2018 (CVPR 2018). We explore the potential of multiple public GIS map datasets (OpenStreetMap, Google Maps, and MapWorld) through integration with the WorldView-3 satellite datasets in four cities (Las Vegas, Paris, Shanghai, and Khartoum). Several strategies are designed and combined with the U-Net–based semantic segmentation model, including data augmentation, post-processing, and integration of the GIS map data and satellite images. The proposed method achieves a total F1-score of 0.704, which is an improvement of 1.1% to 12.5% compared with the top three solutions in the SpaceNet Building Detection Competition and 3.0% to 9.2% compared with the standard U-Net–based method. Moreover, the effect of each proposed strategy and the possible reasons for the building footprint extraction results are analyzed substantially considering the actual situation of the four cities.

show abstract

“…() and Piramanayagam et al. ()) which can only be computed after extracting the DEM. The generated building map can be either building regions (such as a binary image) or segmented laser points.…”

Section: Introduction and Previous Researchmentioning

confidence: 99%

“…The main disadvantage, however, lies in an expensive training effort. Nevertheless, an nDSM is an essential input in processing (for example, as used by Marmanis et al (2018) and Piramanayagam et al (2018)) which can only be computed after extracting the DEM. The generated building map can be either building regions (such as a binary image) or segmented laser points.…”

Section: Introduction and Previous Researchmentioning

confidence: 99%

Building detection and regularisation using DSM and imagery information

Mousa

Helmholz

Belton

et al. 2019

The Photogrammetric Record

View full text Add to dashboard Cite

An automatic method for the regularisation of building outlines is presented, utilising a combination of data‐ and model‐driven approaches to provide a robust solution. The core part of the method includes a novel data‐driven approach to generate approximate building polygons from a list of given boundary points. The algorithm iteratively calculates and stores likelihood values between an arbitrary starting boundary point and each of the following boundary points using a function derived from the geometrical properties of a building. As a preprocessing step, building segments have to be identified using a robust algorithm for the extraction of a digital elevation model. Evaluation results on a challenging dataset achieved an average correctness of 96·3% and 95·7% for building detection and regularisation, respectively.

show abstract

Supervised Classification of Multisensor Remotely Sensed Images Using a Deep Learning Framework

Cited by 80 publications

References 48 publications

Context Aggregation Network for Semantic Labeling in Aerial Images

Context Aggregation Network for Semantic Labeling in Aerial Images

Semantic Segmentation-Based Building Footprint Extraction Using Very High-Resolution Satellite Images and Multi-Source GIS Data

Building detection and regularisation using DSM and imagery information

Contact Info

Product

Resources

About