2020
DOI: 10.48550/arxiv.2010.00067
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

GCNNMatch: Graph Convolutional Neural Networks for Multi-Object Tracking via Sinkhorn Normalization

Abstract: This paper proposes a novel method for online Multi-Object Tracking (MOT) using Graph Convolutional Neural Network (GCNN) based feature extraction and end-toend feature matching for object association. The Graph based approach incorporates both appearance and geometry of objects at past frames as well as the current frame into the task of feature learning. This new paradigm enables the network to leverage the "context" information of the geometry of objects and allows us to model the interactions among the fea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(31 citation statements)
references
References 35 publications
0
31
0
Order By: Relevance
“….545 ( 65) 3010 (71) .618 (33) .659 (22) .698 (50) .555 (49) .411 (32) .348 (30) .498 (18) UNS20regress…”
Section: A Extended Results: Mot17mentioning
confidence: 99%
See 1 more Smart Citation
“….545 ( 65) 3010 (71) .618 (33) .659 (22) .698 (50) .555 (49) .411 (32) .348 (30) .498 (18) UNS20regress…”
Section: A Extended Results: Mot17mentioning
confidence: 99%
“….568 (51) 1320 (10) .573 (50) .583 (50) .681 (55) .562 (43) .410 (35) .341 (31) .501 (16) EMT .556 (58) 1361 (11) .558 (57) .571 (59) .667 (73) .539 (55) .406 (40) .333 (32) .499 (17) ALBOD .569 (50) 2011 (31) .572 (51) .587 (47) .721 (37) .576 (36) .407 (37) .328 (33) .456 (39) ISE MOT17R…”
Section: A Extended Results: Mot17mentioning
confidence: 99%
“…GNNs have been applied in point feature matching [22], [37], gesture learning [45], video moment retrieval [46], visual question answering [47] or single-camera single-object tracking [48]. Regarding single-camera multi-object tracking, [49] proposes the use of GNN to extract node and edge embeddings, but computing similarity using the cosine distance and perform data association by using a linear assignment, i.e., Hungarian Algorithm. The first approach of performing feature and similarity learning jointly for associating detections was introduced in [34] by proposing a time-aware MPN variation: detecting associations across time to perform batch-based/offline singlecamera multi-object tracking.…”
Section: Graph Neural Networkmentioning
confidence: 99%
“…MPNTracker [4] formulates sequences as graphs and designs a differentiable message passing network to predict the score for each box link between frames. Li et al [17] and Papakis et al [19] use a graph neural network to model appearance and motion (geometric) features and produce the similarities between tracklets and detections. These parametric association modules are trained based on appearance features and motion features.…”
Section: Related Workmentioning
confidence: 99%
“…Human-designed policies are sub-optimal as it is difficult for them to take full advantage of both appearance and motion cues. Beyond human-designed policies, more recent arts [4,17,28,19] attempt to learn association knowledge directly from data with a parametric model, i.e., s ij = K θ (i, j, F a , F m ). As illustrated in Fig.…”
Section: Definition Of Association Knowledgementioning
confidence: 99%