2019
DOI: 10.1016/j.robot.2018.11.010
|View full text |Cite
|
Sign up to set email alerts
|

Traffic scene awareness for intelligent vehicles using ConvNets and stereo vision

Abstract: In this paper, we propose an efficient approach to perform recognition and 3D localization of dynamic objects on images from a stereo camera, with the goal of gaining insight into traffic scenes in urban and road environments. We rely on a deep learning framework able to simultaneously identify a broad range of entities, such as vehicles, pedestrians or cyclists, with a frame rate compatible with the strict requirements of onboard automotive applications. Stereo information is later introduced to enrich the kn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 25 publications
(21 citation statements)
references
References 37 publications
0
21
0
Order By: Relevance
“…Apart from the customary horizontal flipping, scale jittering (from −10% to 15% of the original scale) has been applied to training images to improve the robustness of the resulting model. Additionally, the repeat factor sampling proposed in [ 35 ] is used to mitigate the effects of class imbalance. The resulting deep neural network model is aimed to provide 2D bounding box detections of the objects.…”
Section: Data Mining Process and Dataset Organizationmentioning
confidence: 99%
See 1 more Smart Citation
“…Apart from the customary horizontal flipping, scale jittering (from −10% to 15% of the original scale) has been applied to training images to improve the robustness of the resulting model. Additionally, the repeat factor sampling proposed in [ 35 ] is used to mitigate the effects of class imbalance. The resulting deep neural network model is aimed to provide 2D bounding box detections of the objects.…”
Section: Data Mining Process and Dataset Organizationmentioning
confidence: 99%
“… ‘Dominant orientation’ is obtained in several steps. Firstly, the object’s yaw angle is derived using both the viewpoint estimate given by the neural network and the location computed through the pinhole camera model, as in [ 35 ]. Afterward, following the approach employed for viewpoint estimation, the 360° range of possible yaw angles is discretized into eight bins, and the bin representing the orientation of the closest object becomes the value of the “dominant orientation” parameter.…”
Section: Data Mining Process and Dataset Organizationmentioning
confidence: 99%
“…Applying the operation defined by (4) to each channel of the left color image, a total of six intensity edge maps can be obtained. The intensity edge maps are sent into three sets of downsample blocks to obtain a pyramid of intensity edge features.…”
Section: B Depth Discontinuity Aware Super-resolution Subnetworkmentioning
confidence: 99%
“…Depth estimated from stereo images has been the core information for vision-based practical applications, such as obstacle avoidance for robot navigation [1], 3D scene reconstruction for augmented and virtual reality system [2], and 3D visual object tracking and location [3], [4]. Given a pair of pre-rectified stereo images, the target of stereo matching is to accurately compute a disparity value for each pixel in the reference image.…”
Section: Introductionmentioning
confidence: 99%
“…The extended version of this paper, published in the Robotics and Autonomous Systems journal [114], is partially included in the thesis, as stated before.…”
Section: A C K N O W L E D G M E N T S / a G R A D E C I M I E N T O Smentioning
confidence: 99%