Encoder–Decoder Structure Fusing Depth Information for Outdoor Semantic Segmentation

Chen, Songnan; Tang, Mengxia; Dong, Ruifang; Kan, Jiangming

doi:10.3390/app13179924

Cited by 2 publications

(2 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The output results were then resized to the original image size. As shown in Figure 6, from left to right, we have the input single image, from Chen et al [26], and our method. Our network's predictions enable a clearer distinction between trees in outdoor scenes compared with existing methods, and it exhibits better robustness in predicting under varying lighting conditions.…”

Section: Depth Estimation Comparisonmentioning

confidence: 99%

“…Pivotal contributions in this domain include Lina Liu et al's [24] incorporation of domain separation to address illumination variations between day and night images, as well as Michael et al's [25] application of wavelet decomposition for the efficient generation of depth maps. Chen et al predicted depth maps to facilitate forest scene reconstruction through the utilization of a single image, additionally providing forecasts for DBH [26]. Nevertheless, the current methods of forest scene 3D reconstruction from a single image still suffer from critical issues such as low reconstruction accuracy in the forest scene.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

CGAN-Based Forest Scene 3D Reconstruction from a Single Image

Li,

Kan

2024

Forests

Self Cite

View full text Add to dashboard Cite

Forest scene 3D reconstruction serves as the fundamental basis for crucial applications such as forest resource inventory, forestry 3D visualization, and the perceptual capabilities of intelligent forestry robots in operational environments. However, traditional 3D reconstruction methods like LiDAR present challenges primarily because of their lack of portability. Additionally, they encounter complexities related to feature point extraction and matching within multi-view stereo vision sensors. In this research, we propose a new method that not only reconstructs the forest environment but also performs a more detailed tree reconstruction in the scene using conditional generative adversarial networks (CGANs) based on a single RGB image. Firstly, we introduced a depth estimation network based on a CGAN. This network aims to reconstruct forest scenes from images and has demonstrated remarkable performance in accurately reconstructing intricate outdoor environments. Subsequently, we designed a new tree silhouette depth map to represent the tree’s shape as derived from the tree prediction network. This network aims to accomplish a detailed 3D reconstruction of individual trees masked by instance segmentation. Our approach underwent validation using the Cityscapes and Make3D outdoor datasets and exhibited exceptional performance compared with state-of-the-art methods, such as GCNDepth. It achieved a relative error as low as 8% (with an absolute error of 1.76 cm) in estimating diameter at breast height (DBH). Remarkably, our method outperforms existing approaches for single-image reconstruction. It stands as a cost-effective and user-friendly alternative to conventional forest survey methods like LiDAR and SFM techniques. The significance of our method lies in its contribution to technical support, enabling the efficient and detailed utilization of 3D forest scene reconstruction for various applications.

show abstract

Section: Depth Estimation Comparisonmentioning

confidence: 99%