2021
DOI: 10.1109/jstars.2021.3069909
|View full text |Cite
|
Sign up to set email alerts
|

UVid-Net: Enhanced Semantic Segmentation of UAV Aerial Videos by Embedding Temporal Information

Abstract: Semantic segmentation of aerial videos has been extensively used for decision making in monitoring environmental changes, urban planning, and disaster management. The reliability of these decision support systems is dependent on the accuracy of the video semantic segmentation algorithms. The existing CNN based video semantic segmentation methods have enhanced the image semantic segmentation methods by incorporating an additional module such as LSTM or optical flow for computing temporal dynamics of the video w… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 51 publications
(16 citation statements)
references
References 54 publications
0
16
0
Order By: Relevance
“…Based on residual network, it changed the dilated convolution kernels before extracting the correlations between geophysical objects, thus improving the segmentation accuracy and used a pixel-level method to achieve semantic segmentation. Girisha et al [ 19 ] proposed an enhanced encoder–decoder based CNN architecture (UVid-Net) for unmanned aerial vehicle (UAV) video semantic segmentation. This advanced algorithm greatly enhanced the accuracy of the localization.…”
Section: Related Workmentioning
confidence: 99%
“…Based on residual network, it changed the dilated convolution kernels before extracting the correlations between geophysical objects, thus improving the segmentation accuracy and used a pixel-level method to achieve semantic segmentation. Girisha et al [ 19 ] proposed an enhanced encoder–decoder based CNN architecture (UVid-Net) for unmanned aerial vehicle (UAV) video semantic segmentation. This advanced algorithm greatly enhanced the accuracy of the localization.…”
Section: Related Workmentioning
confidence: 99%
“…Moreover, they explore the performance of semantic segmentation algorithms for aerial videos achieved with the Fully Convolution Networks (FCN) and U-net architectures. Girisha et al [119] proposed an enhanced encoder-decoderbased CNN architecture (UVid-Net) for UAV video semantic segmentation. The encoder can embed temporal information in terms of temporally consistent labelling.…”
Section: Semantic Segmentationmentioning
confidence: 99%
“…In 2021, Girisha et al [69] have an improved encoderdecoder-based CNN architecture termed Uvid-Net for semantic segmentation from UAV video frames. is architecture was used to incorporate the temporal smoothness, which has captured the correlation among the sequence of frames using multibranch CNNs.…”
Section: Literature Review On State-of-the-artmentioning
confidence: 99%