2023
DOI: 10.1109/tmm.2023.3235300
|View full text |Cite
|
Sign up to set email alerts
|

Delving Deep Into One-Shot Skeleton-Based Action Recognition With Diverse Occlusions

Abstract: To integrate action recognition methods into autonomous robotic systems, it is crucial to consider adverse situations involving target occlusions. Such a scenario, despite its practical relevance, is rarely addressed in existing selfsupervised skeleton-based action recognition methods. To empower robots with the capacity to address occlusion, we propose a simple and effective method. We first pre-train using occluded skeleton sequences, then use k-means clustering (KMeans) on sequence embeddings to group seman… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(4 citation statements)
references
References 91 publications
0
4
0
Order By: Relevance
“…Te results are compared with those of similar approaches used in recent years as shown in Table 3. Te method can also compete with deep learningbased methods that have emerged in recent years for action recognition [101][102][103].…”
Section: Comparisonmentioning
confidence: 99%
“…Te results are compared with those of similar approaches used in recent years as shown in Table 3. Te method can also compete with deep learningbased methods that have emerged in recent years for action recognition [101][102][103].…”
Section: Comparisonmentioning
confidence: 99%
“…Additionally, it incorporates a graph neural network (GNN) to improve performance by considering the relationships between samples. Peng et al [28] discuss skeleton-based one-shot action recognition (SOAR), which explicitly addresses occlusions. They generate diverse forms of occlusions using realistic furniture models.…”
Section: Technological Trends In the Video Action Classification Modelmentioning
confidence: 99%
“…Many works focus on the classification setting, such as for images ([4,13,19,20] or for labeling activities in videos [21,22]). In Deep CORAL [13], the authors aim to improve the model performance on the OOD data based on access to the ID data and some OOD data.…”
Section: Domain Adaptationmentioning
confidence: 99%
“…A similar idea of aligning certain views of the samples from the source and target domain, but for videos, is used in [22]. Other works try to address domain adaptation for classification in a way that relies on having access to categorical labels [19][20][21]. Those techniques are not applicable for next-frame prediction, however.…”
Section: Domain Adaptationmentioning
confidence: 99%