2021
DOI: 10.1051/e3sconf/202130901070
|View full text |Cite
|
Sign up to set email alerts
|

CNN Based Monocular Depth Estimation

Abstract: In several applications, such as scene interpretation and reconstruction, precise depth measurement from images is a significant challenge. Current depth estimate techniques frequently provide fuzzy, low-resolution estimates. With the use of transfer learning, this research executes a convolutional neural network for generating a high-resolution depth map from a single RGB image. With a typical encoder-decoder architecture, when initializing the encoder, we use features extracted from high-performing pre-train… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 16 publications
0
0
0
Order By: Relevance
“…Swaraja et al [30] conducted experiments using 400 homogeneous and 4600 heterogeneous images in object depth detection. The results indicated that EfficientNet exhibited lower object depth detection accuracy at higher input resolutions than ResNet50.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Swaraja et al [30] conducted experiments using 400 homogeneous and 4600 heterogeneous images in object depth detection. The results indicated that EfficientNet exhibited lower object depth detection accuracy at higher input resolutions than ResNet50.…”
Section: Methodsmentioning
confidence: 99%
“…The model will be loaded after the input image has been divided into numerous frames, and it will then be analyzed to determine an object's depth. This system uses a pre-trained model from Alhashim et al [30], which was developed using the NYU v2 dataset and the DenseNet-169 architectural encoder.…”
Section: F Depth Object Estimationmentioning
confidence: 99%
“…Among them, FPGAbased solutions offer many advantages, including lower power consumption while preserving higher inference speed, customization, and further enhancement possibilities via ASIC implementation. To mention a few, [3] applied two of the most common CNN-based network architectures, namely ResNet-50 and EfficientNet-B0, to solve the depth estimation problem. [9] implemented a depth-from-motion model using an optical flow algorithm on FPGA, which offers high speed and low resource consumption on hardware; however, the results are motion-dependent and show lower accuracy compared to more sophisticated NN-based methods.…”
Section: Related Workmentioning
confidence: 99%