2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) 2017
DOI: 10.1109/dicta.2017.8227494
|View full text |Cite
|
Sign up to set email alerts
|

TenniSet: A Dataset for Dense Fine-Grained Event Recognition, Localisation and Description

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 36 publications
0
5
0
Order By: Relevance
“…TenniSet [195] comprises five tennis videos of 2012 London Olympic matches from YouTube and six categories of events are considered, such as set, hit and serve. The time boundary of each event is labeled, therefore, it can be used for both recognition and localization.…”
Section: E Tennismentioning
confidence: 99%
See 2 more Smart Citations
“…TenniSet [195] comprises five tennis videos of 2012 London Olympic matches from YouTube and six categories of events are considered, such as set, hit and serve. The time boundary of each event is labeled, therefore, it can be used for both recognition and localization.…”
Section: E Tennismentioning
confidence: 99%
“…One recent work -Temporal Query Networks (TQN) [283] combines 3D CNNs and transformers. Specifically, 3D CNNs are used as the backbone to extract video features and transformers are adopted as decoders, i.e., given a query, the transformers [190] Lightweight 3D [280] 2022 90.9 TenniSet [195] Two-stream [195] 2017 81.0 output a response, where the queries are texts like the number of flips for diving and the responses are the corresponding attributes, such as a number or a label. The transformer-based decoder models the relevance among visual features, queries and responses.…”
Section: B Deep Modelsmentioning
confidence: 99%
See 1 more Smart Citation
“…• Comparing to the table tennis datasets [14,51], P 2 A has an adequate sample size (10 times ∼ 200 times) and more subtle set of categories. • Comparing to the large datasets in other sports domain [6,38,42,45], P 2 A focuses on the dense (5 times ∼ 50 times) and fastmoving (around 0.1 times) actions.…”
Section: Dataset Analysismentioning
confidence: 99%
“…THETIS [256] includes 1,980 self-recorded videos of 12 tennis actions with RGB, depth, 2D skeleton, and 3D skeleton videos, which can be used for multiple types of action recognition models. TenniSet [257] contains five Olympic tennis match videos with six labeled event categories and textural descriptions, making it suitable for both recognition, localization, and action retrieval tasks.…”
Section: E Tennismentioning
confidence: 99%