2021
DOI: 10.48550/arxiv.2104.11452
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SportsCap: Monocular 3D Human Motion Capture and Fine-grained Understanding in Challenging Sports Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 56 publications
0
7
0
Order By: Relevance
“…Our work can be classified under AQA and SA, which involves the computer vision-based quantification of the quality of movements and actions. Works in AQA and SA have mainly been focused on domains like physiotherapy [6,19,25,33,36], Olympic sports [3,24,28,35,39,41], various types of skills [5,20,26,38]. However, workout form assessment, especially, in real-world conditions, has not received much attention.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Our work can be classified under AQA and SA, which involves the computer vision-based quantification of the quality of movements and actions. Works in AQA and SA have mainly been focused on domains like physiotherapy [6,19,25,33,36], Olympic sports [3,24,28,35,39,41], various types of skills [5,20,26,38]. However, workout form assessment, especially, in real-world conditions, has not received much attention.…”
Section: Related Workmentioning
confidence: 99%
“…This is especially prevalent in non-daily action classes like fitness and sports domains. This can be mitigated, for example, by annotating domain-specific datasets [3], but that requires a considerable amount of manual annotation efforts, financial resources, and 3D annotations can only be obtained in controlled conditions. Therefore, we propose to learn domain-specific pose-sensitive representations from unlabeled videos, which can be finetuned using only a small labeled dataset.…”
Section: Related Workmentioning
confidence: 99%
“…The highend solutions [9,12,13,24] adopt studio-setup with dense cameras to produce high-quality reconstruction and surface motion, but the synchronized and calibrated multi-camera systems are both difficult to deploy and expensive. The recent low-end approaches [10,16,21,66] enable light-weight performance capture under the single-view setup or even hand-held capture setup or drone-based capture setup [68]. However, these methods require a naked human model or pre-scanned template.…”
Section: Related Workmentioning
confidence: 99%
“…The high-end solutions [Dou et al, 2017;Joo et al, 2018;Chen et al, 2019] require studio-setup with the dense view of cameras and a controlled imaging environment to generate high-fidelity reconstruction and high-quality surface motion, which are expensive and difficult to deploy. The recent low-end approaches [Xiang et al, 2019;Chen et al, 2021] enable light-weight performance capture under the single-view setup. However, these methods require a naked human model or pre-scanned template.…”
Section: Related Workmentioning
confidence: 99%