2021
DOI: 10.1109/tpami.2019.2957464
|View full text |Cite
|
Sign up to set email alerts
|

GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild

Abstract: In this work, we introduce a large high-diversity database for generic object tracking, called GOT-10k. GOT-10k is backboned by the semantic hierarchy of WordNet [1]. It populates a majority of 563 object classes and 87 motion patterns in real-world, resulting in a scale of over 10 thousand video segments and 1.5 million bounding boxes. To our knowledge, GOT-10k is by far the richest motion trajectory dataset, and its coverage of object classes is more than a magnitude wider than similar scale counterparts [20… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
863
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 1,178 publications
(868 citation statements)
references
References 91 publications
(208 reference statements)
5
863
0
Order By: Relevance
“…The benchmarks are divided into two categories, the validation set including OTB2013 [43] and OTB2015 [44], and the testing set including VOT2017 [20], VOT2018 [19], and GOT10k [17]. We introduce the implementation details in the next sub-section and then we compare the proposed tracker to the state-ofthe-art trackers.…”
Section: Resultsmentioning
confidence: 99%
“…The benchmarks are divided into two categories, the validation set including OTB2013 [43] and OTB2015 [44], and the testing set including VOT2017 [20], VOT2018 [19], and GOT10k [17]. We introduce the implementation details in the next sub-section and then we compare the proposed tracker to the state-ofthe-art trackers.…”
Section: Resultsmentioning
confidence: 99%
“…To obtain tracking demonstrations, we ran SiamFC on the training set of GOT-10k dataset [18]. The implemented SiamFC was trained on the ImageNet VID dataset [8].…”
Section: Implementation Detailsmentioning
confidence: 99%
“…Training Dataset. To train A3CT and A3CTD we leveraged of the training set of the GOT-10k dataset [18]. This is a large-scale dataset containing 9335 training videos, 180 validation videos and other 180 videos for testing.…”
Section: Implementation Detailsmentioning
confidence: 99%
See 1 more Smart Citation
“…First column: appearance target. Because our method does not require keypoint-annotations or class information, it can be readily applied on video datasets [23,14]. Besides intra-species analogies, our approach can also imagine inter-species analogies: How does a cow look like in a pose specified by a horse?…”
Section: Introductionmentioning
confidence: 99%