2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022
DOI: 10.1109/cvprw56347.2022.00289
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 18 publications
0
4
0
Order By: Relevance
“…In scene graph generation (SGG), the traditional twostage paradigm involves object detection and pairwise predicate estimation [49][50][51][52][53][54][55][56][57][58][59][60]. Recent advancements include knowledge graph embeddings, graph-based architectures, energybased models, and linguistic supervision [56,[61][62][63][64][65][66][67][68][69]. To address challenges like long-tailed distribution and visually irrelevant predicates, the field has seen a pivot towards panoptic segmentation-based SGG, inspired by the simultaneous generation of scene graphs and semantic segmentation masks [34].…”
Section: Related Workmentioning
confidence: 99%
“…In scene graph generation (SGG), the traditional twostage paradigm involves object detection and pairwise predicate estimation [49][50][51][52][53][54][55][56][57][58][59][60]. Recent advancements include knowledge graph embeddings, graph-based architectures, energybased models, and linguistic supervision [56,[61][62][63][64][65][66][67][68][69]. To address challenges like long-tailed distribution and visually irrelevant predicates, the field has seen a pivot towards panoptic segmentation-based SGG, inspired by the simultaneous generation of scene graphs and semantic segmentation masks [34].…”
Section: Related Workmentioning
confidence: 99%
“…Multiple cameras often have shooting coverage areas in the spatial distribution, which ensures that there are no blind spots in monitoring and can continuously track target objects [23]. To automatically determine the target in the next camera's field of vision, it is necessary to match the target in the overlapping area of the camera.…”
Section: B Construction Of a Multi Camerapositioning Systemmentioning
confidence: 99%
“…Unlike conventional action recognition methods that focus on identifying individual actions, GAR aims to classify the actions of a group of people in a given video clip as a whole. It requires a deeper understanding of the interactions between multiple actors, including accurate localization of actors and modeling their spatiotemporal relationships [1][2][3][4][5][6][7][8]. As a result, GAR poses fundamental challenges that must be addressed to develop practical solutions for this problem.…”
Section: Introductionmentioning
confidence: 99%