2021
DOI: 10.48550/arxiv.2107.02156
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Do Different Tracking Tasks Require Different Appearance Models?

Abstract: Tracking objects of interest in a video is one of the most popular and widely applicable problems in computer vision. However, with the years, a Cambrian explosion of use cases and benchmarks has fragmented the problem in a multitude of different experimental setups. As a consequence, the literature has fragmented too, and now the novel approaches proposed by the community are usually specialised to fit only one specific setup. To understand to what extent this specialisation is actually necessary, in this wor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 105 publications
(135 reference statements)
0
5
0
Order By: Relevance
“…The implementation source codes of ByteTrack [46], Deep OC_SORT [47], BoTSORT [48], OC_SORT [49], and Strong_SORT [50] come from [51], while TransTrack [52] and Uni-Track [53] adopt the authors' original codes, and DeepSORT uses an implementation from [54]. These methodologies all rely on YOLOX-x, as we know that the anchor-free weights file (yolox_x.pth, 756 MB) outperformed all other detectors and facilitated a relatively fair comparison among various tracking and association processes.…”
Section: Comparison With Other Methodsmentioning
confidence: 99%
“…The implementation source codes of ByteTrack [46], Deep OC_SORT [47], BoTSORT [48], OC_SORT [49], and Strong_SORT [50] come from [51], while TransTrack [52] and Uni-Track [53] adopt the authors' original codes, and DeepSORT uses an implementation from [54]. These methodologies all rely on YOLOX-x, as we know that the anchor-free weights file (yolox_x.pth, 756 MB) outperformed all other detectors and facilitated a relatively fair comparison among various tracking and association processes.…”
Section: Comparison With Other Methodsmentioning
confidence: 99%
“…These five methods were retrained on 26.6 K images of CrowdHuman, MOT17, Cityperson and ETHZ datasets [7]. TransTrack [24] and Uni-Track [31] are in the authors' original codes, and DeepSORT uses an implementation from [41]. These three methodologies rely on vanilla YOLOX-x (yolox_x.pth, 756 MB).…”
Section: Comparision Of Other Sota Methodsmentioning
confidence: 99%
“…GMPHD_MAF [23] takes instance segmentation results as input and performs data association based on the Gaussian mixture probability hypothesis density filter for position and motion, and kernelized correlation filter for appearance. UniTrack [26] proposes a unified framework for solving five different tasks. It consists of an appearance model that is task agnostic, and several heads which are used for solving each task and do not require training.…”
Section: Multiple Object Tracking and Segmentationmentioning
confidence: 99%