Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Application 2017
DOI: 10.5220/0006128904930500
|View full text |Cite
|
Sign up to set email alerts
|

Spatio-temporal Road Detection from Aerial Imagery using CNNs

Abstract: Abstract:The main goal of this paper is to detect roads from aerial imagery recorded by drones. To achieve this, we propose a modification of SegNet, a deep fully convolutional neural network for image segmentation. In order to train this neural network, we have put together a database containing videos of roads from the point of view of a small commercial drone. Additionally, we have developed an image annotation tool based on the watershed technique, in order to perform a semi-automatic labeling of the video… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 11 publications
0
6
0
Order By: Relevance
“…Remote sensing optical data obtained from UAVs or airborne/spaceborne platforms are mainly used for the extraction of roads, as described in several previous studies [ 18 , 19 , 20 , 21 , 22 ]. Meanwhile, synthetic aperture radar (SAR) data, especially high-resolution X-band SAR datasets, focus on the quality of the road.…”
Section: Methodsmentioning
confidence: 99%
“…Remote sensing optical data obtained from UAVs or airborne/spaceborne platforms are mainly used for the extraction of roads, as described in several previous studies [ 18 , 19 , 20 , 21 , 22 ]. Meanwhile, synthetic aperture radar (SAR) data, especially high-resolution X-band SAR datasets, focus on the quality of the road.…”
Section: Methodsmentioning
confidence: 99%
“…Similarly, Hu et al [36] take a FCN-like approach to extract features from the last convolutional layer at multiple scales and encode them into global image features through feature coding techniques. In [37], this task is approached from a temporal perspective by modifying SegNet to extract roads from low resolution videos captured by Unmanned Aerial Vehicles (UAVs). The model uses feature maps from the corresponding encoder stage to upsample and adds an iterative algorithm to process the temporal dimension.…”
Section: Related Workmentioning
confidence: 99%
“…In [24], this challenge is approached from a spatio-temporal perspective by adding a temporal processing block on top of existing networks, achieving accuracy levels over 90%. In [25], the authors proposed a model based on VGGNet to learn the features of road boundaries by integrating RGB images street scenes, the semantic contour and the location in a neural network for autonomous driving applications.…”
Section: Related Workmentioning
confidence: 99%