2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) 2021
DOI: 10.1109/iccvw54120.2021.00108
|View full text |Cite
|
Sign up to set email alerts
|

MonoCInIS: Camera Independent Monocular 3D Object Detection using Instance Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 17 publications
(5 citation statements)
references
References 56 publications
0
2
0
Order By: Relevance
“…The association stage of the tracked objects depends on quasi-dense similarity learning to identify objects of interest in different positions and views based on visual features. MonoCInIS [111] presents a method that utilizes instance segmentation in order to estimate the object's pose. The proposed method is camera-independent in order to account for variable camera perspectives.…”
Section: D-data-based Techniquesmentioning
confidence: 99%
“…The association stage of the tracked objects depends on quasi-dense similarity learning to identify objects of interest in different positions and views based on visual features. MonoCInIS [111] presents a method that utilizes instance segmentation in order to estimate the object's pose. The proposed method is camera-independent in order to account for variable camera perspectives.…”
Section: D-data-based Techniquesmentioning
confidence: 99%
“…Monoloco [4] tackles the fundamentally ill-posed problem of 3D human localization from monocular RGB images by predicting confidence intervals through a loss function based on the Laplace distribution with a lightweight 3D localization neural network. MonoIS [6] provides a category-level pose estimation method based on instance segmentation, using camera-independent geometric reasoning to compare with the varying camera viewpoints and intrinsic of different datasets. Additionally, utilizing segmentation masks results in enhanced outcomes for this strategy [17], [18].…”
Section: A Monocular-based Approach For Object Detection and 3d Local...mentioning
confidence: 99%
“…Monoloco achieved outstanding results on the pedestrian localization task with the KITTI-3D dataset benchmark, making real-time predictions of 3D position with a monocular camera as the only sensor. Recent work has exploited the rich semantic information from segmentation masks to leverage 2D object detection and use for 3D position prediction such as in [6]. The accuracy of these methods is limited to the training dataset such as KITTI-3D [7] and nuScenes [8].…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, these representation models impose challenges for generalizing the object shapes to the real‐world scene (Zakharov et al., 2020). Finally, object segmentation information with masks is helpful in shape reconstruction (Beker et al., 2020) and a 3D object location (Heylen et al., 2021). Likewise, these prior‐based methods suffer from the extra task network, domain shift, and 3D GT data as supervisory signals.…”
Section: Introductionmentioning
confidence: 99%