Monocular Depth Perception on Microcontrollers for Edge Applications

Peluso, Valentino; Cipolletta, Antonio; Calimera, Andrea; Poggi, Matteo; Tosi, Fabio; Aleotti, Filippo; Mattoccia, Stefano

doi:10.1109/tcsvt.2021.3077395

Cited by 10 publications

(5 citation statements)

References 66 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Nowadays, to address data confidentiality issues and bandwidth limitations, the trend is to push deep learning based systems from the cloud to edge devices [15,16,17], such as Internet-of-Things (IoTs) devices, given the ever-increasing internet-connected IoTs. One of the principal advantage is that it alleviates the communication latency which is unacceptable for real-time safety-critical decisions, e.g., in autonomous driving.…”

mentioning

confidence: 99%

Investigating data representation for efficient and reliable Convolutional Neural Networks

Ruospo

Sánchez

Traiola

et al. 2021

Microprocessors and Microsystems

View full text Add to dashboard Cite

mentioning

confidence: 99%

Investigating data representation for efficient and reliable Convolutional Neural Networks

Ruospo

Sánchez

Traiola

et al. 2021

Microprocessors and Microsystems

View full text Add to dashboard Cite

“…Further, Peluso et al [47] (2022) propose an efficient monocular depth estimation method for microcontrollers based on a lightweight CNN with a shallow pyramidal architecture. By using optimization strategies to perform calculations on 8-bit data and mapping the high-level description of the network to low-level layers optimized for the target microcontroller architecture, experimental results show that it is possible to obtain depth estimates sufficiently accurate for objects with large overlap areas.…”

Section: Depth Estimationmentioning

confidence: 99%

Application of Machine Vision Techniques in Low-Cost Devices to Improve Efficiency in Precision Farming

Jaramillo-Hernández,

Julian,

Marco-Detchart

et al. 2024

Sensors

View full text Add to dashboard Cite

In the context of recent technological advancements driven by distributed work and open-source resources, computer vision stands out as an innovative force, transforming how machines interact with and comprehend the visual world around us. This work conceives, designs, implements, and operates a computer vision and artificial intelligence method for object detection with integrated depth estimation. With applications ranging from autonomous fruit-harvesting systems to phenotyping tasks, the proposed Depth Object Detector (DOD) is trained and evaluated using the Microsoft Common Objects in Context dataset and the MinneApple dataset for object and fruit detection, respectively. The DOD is benchmarked against current state-of-the-art models. The results demonstrate the proposed method’s efficiency for operation on embedded systems, with a favorable balance between accuracy and speed, making it well suited for real-time applications on edge devices in the context of the Internet of things.

show abstract

“…The model is aimed to overcome cutting-edge design difficulties, which are often deep and complicated, requiring dedicated hardware for their execution such as high-end and power-hungry GPUs. Peluso et al., on their side, propose [ 8 , 10 ]. The first work presents a framework for optimizing inference performance in order to produce a low-latency/high-throughput code.…”

Section: Related Workmentioning

confidence: 99%

“…Regarding monocular depth estimation, only a few works propose a solution for porting such complex tasks on low-resource platforms. There are two main approaches: [ 8 , 9 , 10 ] that focus on MDE on microcontroller and ARM-powered devices without taking into account the inference frequency and [ 11 , 12 , 13 ], which analyze the inference performances of MDE on low-power embedded GPUs. Furthermore, MDE methods are usually trained in supervised learning strategies on indoor and outdoor terrestrial datasets such as [ 14 , 15 ].…”

Section: Introductionmentioning

confidence: 99%

Lightweight and Energy-Aware Monocular Depth Estimation Models for IoT Embedded Devices: Challenges and Performances in Terrestrial and Underwater Scenarios

Papa

Mattia

Russo

et al. 2023

Sensors

View full text Add to dashboard Cite

The knowledge of environmental depth is essential in multiple robotics and computer vision tasks for both terrestrial and underwater scenarios. Moreover, the hardware on which this technology runs, generally IoT and embedded devices, are limited in terms of power consumption, and therefore, models with a low-energy footprint are required to be designed. Recent works aim at enabling depth perception using single RGB images on deep architectures, such as convolutional neural networks and vision transformers, which are generally unsuitable for real-time inferences on low-power embedded hardware. Moreover, such architectures are trained to estimate depth maps mainly on terrestrial scenarios due to the scarcity of underwater depth data. Purposely, we present two lightweight architectures based on optimized MobileNetV3 encoders and a specifically designed decoder to achieve fast inferences and accurate estimations over embedded devices, a feasibility study to predict depth maps over underwater scenarios, and an energy assessment to understand which is the effective energy consumption during the inference. Precisely, we propose the MobileNetV3S75 configuration to infer on the 32-bit ARM CPU and the MobileNetV3LMin for the 8-bit Edge TPU hardware. In underwater settings, the proposed design achieves comparable estimations with fast inference performances compared to state-of-the-art methods. Moreover, we statistically proved that the architecture of the models has an impact on the energy footprint in terms of Watts required by the device during the inference. Then, the proposed architectures would be considered to be a promising approach for real-time monocular depth estimation by offering the best trade-off between inference performances, estimation error and energy consumption, with the aim of improving the environment perception for underwater drones, lightweight robots and Internet of things.

show abstract

Monocular Depth Perception on Microcontrollers for Edge Applications

Cited by 10 publications

References 66 publications

Investigating data representation for efficient and reliable Convolutional Neural Networks

Investigating data representation for efficient and reliable Convolutional Neural Networks

Application of Machine Vision Techniques in Low-Cost Devices to Improve Efficiency in Precision Farming

Lightweight and Energy-Aware Monocular Depth Estimation Models for IoT Embedded Devices: Challenges and Performances in Terrestrial and Underwater Scenarios

Contact Info

Product

Resources

About