2020
DOI: 10.5194/isprs-annals-v-2-2020-321-2020
|View full text |Cite
|
Sign up to set email alerts
|

Y-Shaped Convolutional Neural Network for 3d Roof Elements Extraction to Reconstruct Building Models From a Single Aerial Image

Abstract: Abstract. Fast and efficient detection and reconstruction of buildings have become essential in real-time applications such as navigation, 3D rendering, augmented reality, and 3D smart cities. In this study, a modern Deep Learning (DL)-based framework is proposed for automatic detection, localization, and height estimation of buildings, simultaneously, from a single aerial image. The proposed framework is based on a Y-shaped Convolutional Neural Network (Y-Net) which includes one encoder and two decoders. The … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 28 publications
0
11
0
Order By: Relevance
“…For [AAT19], it outputs 43.4% completeness and 4% correctness for just roof ridges (their completeness and correctness is higher when you also consider the building footprint pixels). In [AAH20], they improved results to 57.7% completeness and 81.3% correctness for roof ridges (using the same metrics as [AAT19]). At satellite‐level resolutions, the correctness and completeness term they provide does not seem suitable.…”
Section: Implementation and Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…For [AAT19], it outputs 43.4% completeness and 4% correctness for just roof ridges (their completeness and correctness is higher when you also consider the building footprint pixels). In [AAH20], they improved results to 57.7% completeness and 81.3% correctness for roof ridges (using the same metrics as [AAT19]). At satellite‐level resolutions, the correctness and completeness term they provide does not seem suitable.…”
Section: Implementation and Resultsmentioning
confidence: 99%
“…Ywata et al [YDSdO21] introduce a method to extract building roof boundaries in object space by integrating a high‐resolution aerial images stereo pair and three‐dimensional roof models reconstructed from LiDAR data. In [AAT19; AAH20], they work on a deep learning‐based approach to detect and reconstruct roof parts of buildings from a single image. However, they require high resolution aerial images and annotate the dataset for roof ridges and building boundaries.…”
Section: Related Workmentioning
confidence: 99%
“…Within the algorithm in (Alidoost et al, 2020), a Y-shaped CNN is applied to detect eave, ridge, and hip lines in a single aerial image. In a post-processing step outside the neural network, eave lines are used to subdivide building footprints into individual roof areas.…”
Section: Related Workmentioning
confidence: 99%
“…Bauchet and Lafarge [2020] adopt a kinetic data structure to partition the 3D space into convex polyhedra from which the underlying surface mesh of the input point cloud can be extracted. Another category of papers use deep learning [Alidoost et al 2020;Yu et al 2021;Zeng et al 2018; to directly output the reconstruction of 3D roof structures. However, these methods do not propose a solution for enforcing geometric constraints, while in our work, we mainly focus on enforcing geometric constraints in an optimization-based formulation during interactive roof reconstruction.…”
Section: Roof Reconstructionmentioning
confidence: 99%