2020
DOI: 10.3390/s20020532
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning for Real-Time 3D Multi-Object Detection, Localisation, and Tracking: Application to Smart Mobility

Abstract: In core computer vision tasks, we have witnessed significant advances in object detection, localisation and tracking. However, there are currently no methods to detect, localize and track objects in road environments, and taking into account real-time constraints. In this paper, our objective is to develop a deep learning multi object detection and tracking technique applied to road smart mobility. Firstly, we propose an effective detector-based on YOLOv3 which we adapt to our context. Subsequently, to localiz… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
22
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 48 publications
(24 citation statements)
references
References 28 publications
1
22
0
1
Order By: Relevance
“…For the distance evaluation between the camera (which is mounted on the wheelchair) and the different objects in the indoor environment, we used in previous works different CNN models dedicated to depth estimation like Monodepth [ 12 ], Monodepth2 [ 1 , 44 ], and MadNET [ 17 , 45 ]. In this paper, we have carried out distance measurements by directly using the RealSense camera (without any deep learning model) because of embedded-system constraints related to the wheelchair such as: not enough GPU computational power on the Jetson board, not enough space memory, and the distance estimation of object which should be done in real time.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…For the distance evaluation between the camera (which is mounted on the wheelchair) and the different objects in the indoor environment, we used in previous works different CNN models dedicated to depth estimation like Monodepth [ 12 ], Monodepth2 [ 1 , 44 ], and MadNET [ 17 , 45 ]. In this paper, we have carried out distance measurements by directly using the RealSense camera (without any deep learning model) because of embedded-system constraints related to the wheelchair such as: not enough GPU computational power on the Jetson board, not enough space memory, and the distance estimation of object which should be done in real time.…”
Section: Resultsmentioning
confidence: 99%
“…Object detection methods based on deep-learning are among those giving the best performances on all methods. They can be divided into two main categories: 1. one-stage methods, which perform the object localization and object classification in a single network, and 2. two-stage methods, which have two separated networks for localization and classification [ 1 ]. In the first category, we find the YOLOv3 (You Only Look once) algorithm [ 2 ], in which classification is made on a predefined number of bounding boxes of given sizes at specific layers.…”
Section: State Of the Artmentioning
confidence: 99%
See 1 more Smart Citation
“…These advances in multicore architecture have enabled the use of so-called deep convolutional neural network (CNN) architectures for object detection and classification [17,18]. Mauri et al [2] mentioned that the CNN-based methods have two main categories: the first one is the one-stage methods, this one enables to perform the location and classification of objects in a single network, and the second one is the two-stage methods. The latter contains two separate networks with the purpose that each one of them performs only one task.…”
Section: Convolutional Neural Network Algorithmsmentioning
confidence: 99%
“…One of the aspects to consider for this growth has been to not limit itself only to niches as robotics and manufacturing, but also to other areas such as home automation, intelligent detection, medical image analysis, food industry, autonomous driving, among others [1]. Since the beginning, the objective of computer vision systems has been the automatic processing, analysis and interpretation of images [2], to be precise with some classic algorithms including: local descriptor [3], Haar like features [4], SIFT [5], Shape Contexts [6], Histogram of Gradients (HOG) [7] and Local Binary Patterns (LBP) [8]. In 2012, significant advances were made in image processing methods [9], one of which was the use of deep learning techniques.…”
Section: Introductionmentioning
confidence: 99%