2021
DOI: 10.1109/tip.2021.3113169
|View full text |Cite
|
Sign up to set email alerts
|

Tracking Beyond Detection: Learning a Global Response Map for End-to-End Multi-Object Tracking

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 15 publications
(5 citation statements)
references
References 56 publications
0
5
0
Order By: Relevance
“…In recent years, with the advancement of deep learning and object detection, online tracking has attracted more and more attention. On contrary to offline methods, online methods usually adopt the Hungarian algorithm for data association, but focus on the joint learning of object detection and some useful priors, such as object motions [1], [7], [35], appearance features [4], [36], [37], occlusion maps [36], object poses [38] and so on. However, except for the annotation of box and category ID, extra annotations are required for the learning of these priors, e.g., object identity for appearance feature learning.…”
Section: A Multi-object Trackingmentioning
confidence: 99%
“…In recent years, with the advancement of deep learning and object detection, online tracking has attracted more and more attention. On contrary to offline methods, online methods usually adopt the Hungarian algorithm for data association, but focus on the joint learning of object detection and some useful priors, such as object motions [1], [7], [35], appearance features [4], [36], [37], occlusion maps [36], object poses [38] and so on. However, except for the annotation of box and category ID, extra annotations are required for the learning of these priors, e.g., object identity for appearance feature learning.…”
Section: A Multi-object Trackingmentioning
confidence: 99%
“…In general, the existing MOT methods either follow the tracking-by-detection [2] or tracking-by-regression [39,40,59], paradigm. The former methods first detect objects in each video frame and then associate detections between adjacent frames to create individual object tracks over time.…”
Section: Introductionmentioning
confidence: 99%
“…To forecast the future location and apply the intersection over union and detection in the computation of association costs, Gao et al [22] dynamically introduce sub-networks for each instance of a person. The detection is used as the positive sample and the areas around it as the negative sample by the authors of [23] when employing the discriminative appearance learning approach for each track. Additionally, they employ spatiotemporal matching based on item size and position, and they multiply these three measurements together along with the pair-wise cost Liu et al [24] to achieve high-performance online tracking, the suggested solution is based on the Gaussian mixture probability hypothesis density (GMPHD) filter, a hierarchical data association (HDA), and a mask-based affinity fusion (MAF) model.…”
Section: Introductionmentioning
confidence: 99%